neuralmagic
/

Llama-2-7b-ultrachat200k-pruned_70-quantized-deepsparse

Text Generation

Model card Files Files and versions Community

Update README.md

#1

by alexmarques - opened Mar 18

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +1 -7

README.md CHANGED Viewed

@@ -48,13 +48,7 @@ Model evaluation metrics and results.
 | Benchmark                                      | Metric        | Llama-2-7b-ultrachat  | Llama-2-7b-pruned70-retrained-ultrachat-quant-ds |
 |------------------------------------------------|---------------|-------------|-------------------------------|
-| [MMLU](https://arxiv.org/abs/2009.03300)       | 5-shot, top-1 | xxxx        | xxxx                          |
-| [HellaSwag](https://arxiv.org/abs/1905.07830)  | 0-shot        | xxxx        | xxxx                          |
-| [WinoGrande](https://arxiv.org/abs/1907.10641) | partial score | xxxx        | xxxx                          |
-| [ARC-c](https://arxiv.org/abs/1911.01547)      |               | xxxx        | xxxx                          |
-| [TruthfulQA](https://arxiv.org/abs/2109.07958) | 5-shot        | xxxx        | xxxx                          |
-| [HumanEval](https://arxiv.org/abs/2107.03374)  | pass@1        | xxxx        | xxxx                          |
-| [GSM8K](https://arxiv.org/abs/2110.14168)      | maj@1         | xxxx        | xxxx                          |
 ## Help

 | Benchmark                                      | Metric        | Llama-2-7b-ultrachat  | Llama-2-7b-pruned70-retrained-ultrachat-quant-ds |
 |------------------------------------------------|---------------|-------------|-------------------------------|
+| [AlpacaEval](https://arxiv.org/abs/2107.03374) ([Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-70b-chat-hf) evaluator) | Win rate  | 57.6% | 57.1% |
 ## Help