1 27 5

Daniel Khashabi

danyaljj

danyaljj

AI & ML interests

None yet

Recent Activity

liked a model about 13 hours ago

allenai/unifiedqa-t5-base

liked a dataset 10 days ago

NIH-CARD/CARDBiomedBench

upvoted a paper about 1 month ago

ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models

View all activity

Organizations

liked a model about 13 hours ago

allenai/unifiedqa-t5-base

Updated Jan 24, 2023 • 613 • 12

liked a dataset 10 days ago

NIH-CARD/CARDBiomedBench

Viewer • Updated Jul 21, 2025 • 68.2k • 52 • 5

upvoted a paper about 1 month ago

ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models

Paper • 2510.16928 • Published Oct 19, 2025 • 4

upvoted a paper about 2 months ago

Genomic Next-Token Predictors are In-Context Learners

Paper • 2511.12797 • Published Nov 16, 2025 • 7

commented a paper about 2 months ago

Genomic Next-Token Predictors are In-Context Learners

Paper • 2511.12797 • Published Nov 16, 2025 • 7 •

upvoted a paper 2 months ago

SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains

Paper • 2507.07229 • Published Jul 9, 2025 • 11

authored a paper 2 months ago

World-in-World: World Models in a Closed-Loop World

Paper • 2510.18135 • Published Oct 20, 2025 • 76

upvoted 2 papers 2 months ago

World-in-World: World Models in a Closed-Loop World

Paper • 2510.18135 • Published Oct 20, 2025 • 76

MedScore: Generalizable Factuality Evaluation of Free-Form Medical Answers by Domain-adapted Claim Decomposition and Verification

Paper • 2505.18452 • Published May 24, 2025 • 4

liked a dataset 2 months ago

ash56/ShiftySpeech

Viewer • Updated Oct 24, 2025 • 3M • 818 • 21

upvoted 3 papers 3 months ago

liked a Space 4 months ago

World In World Prototype

🥇

Duplicate this leaderboard to initialize your own!

upvoted a paper 4 months ago

mmBERT: A Modern Multilingual Encoder with Annealed Language Learning

Paper • 2509.06888 • Published Sep 8, 2025 • 12

authored a paper 4 months ago

Jailbreak Distillation: Renewable Safety Benchmarking

Paper • 2505.22037 • Published May 28, 2025 • 1

upvoted 3 papers 4 months ago

The Trickle-down Impact of Reward (In-)consistency on RLHF

Paper • 2309.16155 • Published Sep 28, 2023 • 1