Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
josefonte
's Collections
Benchmarks
Benchmarks
updated
Mar 28, 2025
collection of datasets used to train and test MLMMs (VLMs)
Upvote
-
AI4Math/MathVerse
Viewer
•
Updated
May 15, 2025
•
4.73k
•
2.62k
•
67
MMMU/MMMU
Viewer
•
Updated
Sep 19, 2024
•
11.6k
•
55.7k
•
307
MMMU/MMMU_Pro
Viewer
•
Updated
Mar 8, 2025
•
5.19k
•
8.77k
•
41
AI4Math/MathVista
Viewer
•
Updated
Feb 11, 2024
•
6.14k
•
11.5k
•
199
MathLLMs/MathVision
Viewer
•
Updated
Nov 27, 2025
•
3.34k
•
8.36k
•
112
TIGER-Lab/MEGA-Bench
Viewer
•
Updated
May 7, 2025
•
7.69k
•
1.4k
•
23
lmms-lab/MMBench_EN
Viewer
•
Updated
Mar 8, 2024
•
11.1k
•
312
•
5
Lin-Chen/MMStar
Viewer
•
Updated
Apr 7, 2024
•
1.5k
•
13k
•
45
lmms-lab/MME
Viewer
•
Updated
Dec 23, 2023
•
2.37k
•
28k
•
26
MUIRBENCH/MUIRBENCH
Viewer
•
Updated
Jul 1, 2024
•
2.6k
•
1.68k
•
16
BLINK-Benchmark/BLINK
Viewer
•
Updated
Sep 3, 2025
•
3.81k
•
7.81k
•
36
OpenGVLab/CRPE
Viewer
•
Updated
Mar 21, 2024
•
544
•
679
•
9
ByteDance/MTVQA
Viewer
•
Updated
May 30, 2024
•
8.79k
•
474
•
41
lmms-lab/RealWorldQA
Viewer
•
Updated
Apr 13, 2024
•
765
•
4.94k
•
5
yifanzhang114/MME-RealWorld
Preview
•
Updated
Nov 14, 2024
•
994
•
21
lmms-lab/MMVet
Viewer
•
Updated
Mar 8, 2024
•
218
•
1.25k
•
4
mistralai/MM-MT-Bench
Viewer
•
Updated
Oct 10, 2024
•
92
•
822
•
25
edinburgh-dawg/mmlu-redux
Viewer
•
Updated
Feb 9, 2025
•
3k
•
2.82k
•
37
TIGER-Lab/MMLU-Pro
Viewer
•
Updated
Oct 25, 2025
•
12.1k
•
67.8k
•
403
Idavidrein/gpqa
Viewer
•
Updated
Mar 28, 2024
•
1.25k
•
67k
•
325
openai/gsm8k
Benchmark
•
Updated
14 days ago
•
17.6k
•
415k
•
1.09k
openai/openai_humaneval
Viewer
•
Updated
Jan 4, 2024
•
164
•
126k
•
360
nuprl/MultiPL-E
Viewer
•
Updated
Jul 15, 2025
•
12.7k
•
64.8k
•
60
google/IFEval
Viewer
•
Updated
Aug 14, 2024
•
541
•
44.1k
•
117
opendatalab/OmniDocBench
Viewer
•
Updated
Sep 26, 2025
•
1.36k
•
8.65k
•
59
wulipc/CC-OCR
Viewer
•
Updated
Dec 27, 2024
•
7.06k
•
1.42k
•
5
lmms-lab/ai2d
Viewer
•
Updated
Mar 26, 2024
•
3.09k
•
7.37k
•
17
lmms-lab/textvqa
Viewer
•
Updated
Mar 8, 2024
•
45.3k
•
19.2k
•
23
lmms-lab/DocVQA
Viewer
•
Updated
Apr 18, 2024
•
16.6k
•
22.8k
•
66
HuggingFaceM4/ChartQA
Viewer
•
Updated
Mar 5, 2024
•
32.7k
•
7.41k
•
58
princeton-nlp/CharXiv
Viewer
•
Updated
Sep 27, 2024
•
2.32k
•
3.55k
•
45
AILab-CVC/SEED-Bench-2-plus
Viewer
•
Updated
Apr 27, 2024
•
555
•
77
•
5
echo840/OCRBench
Viewer
•
Updated
Dec 18, 2024
•
1k
•
15.6k
•
17
lmms-lab/OCRBench-v2
Viewer
•
Updated
Feb 9, 2025
•
10k
•
1.2k
•
12
Upvote
-
Share collection
View history
Collection guide
Browse collections