intfloat
/

multilingual-e5-large

Feature Extraction

sentence-transformers

Sentence Transformers

sentence-similarity

Eval Results (legacy)

text-embeddings-inference

Model card Files Files and versions

intfloat commited on Aug 7, 2023

Commit

c505dce

·

1 Parent(s): 9228c45

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -6095,6 +6095,14 @@ Here are some rules of thumb:
 Different versions of `transformers` and `pytorch` could cause negligible but non-zero performance differences.
 ## Citation
 If you find our paper or models helpful, please consider cite as follows:

 Different versions of `transformers` and `pytorch` could cause negligible but non-zero performance differences.
+**3. Why does the cosine similarity scores distribute around 0.7 to 1.0?**
+This is a known and expected behavior as we use a low temperature 0.01 for InfoNCE contrastive loss.
+For text embedding tasks like text retrieval or semantic similarity,
+what matters is the relative order of the scores instead of the absolute values,
+so this should not be an issue.
 ## Citation
 If you find our paper or models helpful, please consider cite as follows: