Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Safetensors

Model size

11B params

Tensor type

F16

Model tree for vicgalle/OpenBeagle-11B

Quantizations

normalized accuracy on AI2 Reasoning Challenge (25-Shot)
test set Open LLM Leaderboard

70.480
normalized accuracy on HellaSwag (10-Shot)
validation set Open LLM Leaderboard

88.760
accuracy on MMLU (5-Shot)
test set Open LLM Leaderboard

66.940
mc2 on TruthfulQA (0-shot)
validation set Open LLM Leaderboard

67.010
accuracy on Winogrande (5-shot)
validation set Open LLM Leaderboard

83.500
accuracy on GSM8k (5-shot)
test set Open LLM Leaderboard

66.410