Weyaxi's picture
Adding Evaluation Results
227e63c
|
raw
history blame
2.52 kB
metadata
license: cc-by-nc-4.0

Buy Me A Coffee

Merge of ehartford/dolphin-2.1-mistral-7b and Open-Orca/Mistral-7B-OpenOrca using ties merge.

Weights

Density

Quantizationed versions

Quantizationed versions of this model is available thanks to TheBloke.

GPTQ
GGUF
AWQ

Evaluation Results (Open LLM Leaderboard)

Metric Value
Avg. 53.0
ARC (25-shot) 63.91
HellaSwag (10-shot) 84.26
MMLU (5-shot) 62.66
TruthfulQA (0-shot) 53.84
Winogrande (5-shot) 78.22
GSM8K (5-shot) 19.94
DROP (3-shot) 8.17

Open LLM Leaderboard Evaluation Results (Details)

Metric Value
Avg. 53.0
ARC (25-shot) 63.91
HellaSwag (10-shot) 84.26
MMLU (5-shot) 62.66
TruthfulQA (0-shot) 53.84
Winogrande (5-shot) 78.22
GSM8K (5-shot) 19.94
DROP (3-shot) 8.17