DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21, 2025 • 450
Running on CPU Upgrade Featured 2.82k The Smol Training Playbook 📚 2.82k The secrets to building world-class LLMs
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 18 days ago • 309
Pix2Poly: A Sequence Prediction Method for End-to-end Polygonal Building Footprint Extraction from Remote Sensing Imagery Paper • 2412.07899 • Published Dec 10, 2024 • 1 • 1
Vision Language Leaderboards Collection This collection has all the vision language leaderboards. • 7 items • Updated Aug 24, 2024 • 21