[dataset] image-text datasets mlfoundations/datacomp_1b Viewer • Updated Aug 21, 2023 • 1.39B • 9.23k • 38 UCSC-VLAA/Recap-DataComp-1B Viewer • Updated Jan 9, 2025 • 1.88B • 6.73k • 197 laion/relaion2B-multi-research-safe Viewer • Updated Jul 3, 2024 • 2.06B • 248 • 47 imthanhlv/laion2B-multi-Vietnamese-subset Updated Sep 12, 2023 • 22 • 3
[dataset] embeddings-and-retrieval-learning Datasets for training embeddings and retrieval models unicamp-dl/mmarco Updated Mar 6, 2024 • 2.01k • 90 miracl/miracl-corpus Viewer • Updated Jan 5, 2023 • 77.2M • 2.78k • 52
[dataset] image-text datasets mlfoundations/datacomp_1b Viewer • Updated Aug 21, 2023 • 1.39B • 9.23k • 38 UCSC-VLAA/Recap-DataComp-1B Viewer • Updated Jan 9, 2025 • 1.88B • 6.73k • 197 laion/relaion2B-multi-research-safe Viewer • Updated Jul 3, 2024 • 2.06B • 248 • 47 imthanhlv/laion2B-multi-Vietnamese-subset Updated Sep 12, 2023 • 22 • 3
[dataset] embeddings-and-retrieval-learning Datasets for training embeddings and retrieval models unicamp-dl/mmarco Updated Mar 6, 2024 • 2.01k • 90 miracl/miracl-corpus Viewer • Updated Jan 5, 2023 • 77.2M • 2.78k • 52