pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 16 days ago • 87
jina-embeddings-v5-text: Task-Targeted Embedding Distillation Paper • 2602.15547 • Published 25 days ago • 26
view article Article **ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?** 23 days ago • 18
ColBERT-Zero 🐶 Collection First large-scale fully pre-trained ColBERT model using only public data, outperforming GTE-ModernColBERT and GTE-ModernBERT • 10 items • Updated 12 days ago • 18
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models Paper • 2602.16609 • Published 24 days ago • 6
OriOn Collection Visual long document VLMs based on Mistral-Small-3.1-24B-Instruct-2503 and Qwen3-VL-32B-Instruct • 4 items • Updated 12 days ago • 4
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling about 1 month ago • 50
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated 12 days ago • 16
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family Jan 19 • 88
NanoBEIR datasets Collection These datasets are compatible with the (Sparse)NanoBEIREvaluator with Sentence Transformers v5.2+. Also CrossEncoderNanoBEIREvaluator if bm25 column • 16 items • Updated 12 days ago • 14
view article Article TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval Dec 4, 2025 • 19
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR Oct 23, 2025 • 73
Simple Projection Variants Improve ColBERT Performance Paper • 2510.12327 • Published Oct 14, 2025 • 7
Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report Paper • 2510.14880 • Published Oct 16, 2025 • 19
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 • 273
Seq vs Seq: An Open Suite of Paired Encoders and Decoders Paper • 2507.11412 • Published Jul 15, 2025 • 31