Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Evaluation datasets
community
Activity Feed
Follow
75
AI & ML interests
None defined yet.
Recent Activity
alozowski
authored
a paper
8 days ago
YourBench: Easy Custom Evaluation Sets for Everyone
SaylorTwift
new
activity
15 days ago
OpenEvals/SimpleQA:
adds_eval_yaml
SaylorTwift
updated
a dataset
15 days ago
OpenEvals/SimpleQA
View all activity
Team members
8
lighteval
's datasets
192
Sort: Recently updated
lighteval/piqa
Viewer
•
Updated
26 days ago
•
21k
•
662
•
1
lighteval/logiqa_harness
Updated
Aug 19
•
29
lighteval/sacrebleu_manual
Viewer
•
Updated
Aug 19
•
936k
•
9.58k
lighteval/lextreme
Viewer
•
Updated
Aug 19
•
194k
•
728
lighteval/bbh
Viewer
•
Updated
Aug 18
•
78.3k
•
610
•
1
lighteval/synthetic_reasoning
Viewer
•
Updated
Aug 18
•
33k
•
867
•
7
lighteval/covid_dialogue
Viewer
•
Updated
Aug 18
•
614
•
87
•
1
lighteval/numeracy
Viewer
•
Updated
Aug 18
•
1.6k
•
281
•
1
lighteval/synthetic_reasoning_natural
Viewer
•
Updated
Aug 18
•
22k
•
122
•
15
lighteval/hendrycks_ethics
Viewer
•
Updated
Aug 18
•
116k
•
188
lighteval/civil_comments_helm
Viewer
•
Updated
Aug 18
•
623k
•
1.47k
•
1
lighteval/TwitterAAE
Viewer
•
Updated
Aug 18
•
100k
•
1.53k
lighteval/EntityMatching
Viewer
•
Updated
Aug 18
•
153k
•
446
•
7
lighteval/me_q_sum
Viewer
•
Updated
Aug 18
•
1.5k
•
14
lighteval/DyckLanguage
Viewer
•
Updated
Aug 18
•
1.51k
•
158
lighteval/lexglue
Viewer
•
Updated
Aug 18
•
473k
•
707
lighteval/wmt_14
Viewer
•
Updated
Aug 18
•
126k
•
247
lighteval/copyright_helm
Viewer
•
Updated
Aug 18
•
17.8k
•
164
lighteval/med_dialog
Viewer
•
Updated
Aug 18
•
257k
•
162
•
8
lighteval/mutual_harness
Viewer
•
Updated
Aug 18
•
17.7k
•
45
•
2
lighteval/boolq_helm
Viewer
•
Updated
Aug 18
•
12.7k
•
641
•
2
lighteval/legal_summarization
Viewer
•
Updated
Aug 18
•
26.9k
•
254
•
25
lighteval/med_paragraph_simplification
Viewer
•
Updated
Aug 18
•
4.46k
•
97
lighteval/code_generation_lite
Viewer
•
Updated
Aug 15
•
12.8k
•
4.72k
•
1
lighteval/lsat_qa
Viewer
•
Updated
Aug 14
•
459
•
230
•
4
lighteval/wikifact
Viewer
•
Updated
Aug 14
•
58.4k
•
1.53k
•
2
lighteval/bigbench_helm
Viewer
•
Updated
Aug 14
•
22.3k
•
1.53k
lighteval/bold_helm
Viewer
•
Updated
Aug 14
•
4.58k
•
148
lighteval/bbq_helm
Viewer
•
Updated
Aug 14
•
11.9k
•
594
•
4
lighteval/winograd_wsc
Viewer
•
Updated
Aug 13
•
558
•
36
Previous
1
2
3
...
7
Next