Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Safetensors

Model size

1B params

Tensor type

F16

Model tree for eren23/DistiLabelOrca-TinyLLama-1.1B

Adapters

Quantizations

normalized accuracy on AI2 Reasoning Challenge (25-Shot)
test set Open LLM Leaderboard

36.180
normalized accuracy on HellaSwag (10-Shot)
validation set Open LLM Leaderboard

61.150
accuracy on MMLU (5-Shot)
test set Open LLM Leaderboard

25.090
mc2 on TruthfulQA (0-shot)
validation set Open LLM Leaderboard

38.050
accuracy on Winogrande (5-shot)
validation set Open LLM Leaderboard

60.850
accuracy on GSM8k (5-shot)
test set Open LLM Leaderboard

1.670