allenai
/

open-instruct-code-alpaca-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hamishivi commited on Jun 9, 2023

Commit

88c78a4

•

1 Parent(s): 295e592

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -46,7 +46,7 @@ Here is the performance of this model across benchmarks explored in our paper [H
 | MMLU 0-shot | MMLU 5-shot | GSM Direct | GSM CoT | BBH Direct | BBH CoT | TydiQA Gold-Passage | TydiQA Closed-book | Codex-Eval Pass@1 | Codex-Eval Pass@10 | AlpacaFarm vs Davinci-003 | Average |
 |:-----------:|:-----------:|:----------:|:-------:|:----------:|:-------:|:-------------------:|:------------------:|:-----------------:|:------------------:|:-------------------------:|---------|
-| 42.6 | 44.3 | 5.0 | 12.0 | 35.5 | 36.6 | 41.3 | 10.9 | 20.1 | 34.5 | 19.4 | 26.8 |
 If you use this model, please cite our work, the llama paper, and the original dataset:

 | MMLU 0-shot | MMLU 5-shot | GSM Direct | GSM CoT | BBH Direct | BBH CoT | TydiQA Gold-Passage | TydiQA Closed-book | Codex-Eval Pass@1 | Codex-Eval Pass@10 | AlpacaFarm vs Davinci-003 | Average |
 |:-----------:|:-----------:|:----------:|:-------:|:----------:|:-------:|:-------------------:|:------------------:|:-----------------:|:------------------:|:-------------------------:|---------|
+| 34.7 | 34.5 | 6.5 | 7.5 | 29.6 | 30.5 | 36.7 | 10.5 | 16.5 | 29.2 | 17.5 | 22.6 |
 If you use this model, please cite our work, the llama paper, and the original dataset: