NIFE models Collection Nearly Inference Free Embedding (NIFE) models trained using pyNIFE: github.com/stephantul/pynife • 2 items • Updated Nov 4, 2025 • 2
view article Article Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text isaacchung • Oct 20, 2025 • 38
HUME: Measuring the Human-Model Performance Gap in Text Embedding Task Paper • 2510.10062 • Published Oct 11, 2025 • 10
Mxbai-large-v1 EmbedPress Collection Large datasets of mxbai-large-v1 embeddings with their truncated texts. Useful for distillation • 13 items • Updated Oct 24, 2025 • 2
view article Article Introducing RTEB: A New Standard for Retrieval Evaluation +4 fzliu, KennethEnevoldsen, Samoed, isaacchung, tomaarsen, fzoll • Oct 1, 2025 • 144
The Ultimate Collection of Code Classifiers Collection 🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated May 5, 2025 • 16
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model EuroBERT • Mar 10, 2025 • 147
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB davidberenstein1957 • Jan 27, 2025 • 22
POTION Collection These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 8 items • Updated about 1 month ago • 15
NanoBEIR 🍺 Collection A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 27
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer Pringled • Oct 14, 2024 • 104
Model2Vec base models Collection These are the Minishlab Model2Vec base models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 11 items • Updated Mar 15 • 9