akjindal53244's picture
Update README.md
432a7a0 verified
|
raw
history blame
1.18 kB
metadata
license: apache-2.0

Model is instruction-finetuned using Open-Platypus dataset: https://huggingface.co/datasets/garage-bAInd/Open-Platypus

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 53.64
ARC (25-shot) 62.37
HellaSwag (10-shot) 85.08
MMLU (5-shot) 63.79
TruthfulQA (0-shot) 47.33
Winogrande (5-shot) 77.66
GSM8K (5-shot) 17.29
DROP (3-shot) 21.93

Support My Work

Building LLMs takes time and resources; if you find my work interesting, your support would be epic! Buy Me A Coffee