garage-bAInd
/

Platypus-30B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lilloukas commited on Jun 28, 2023

Commit

0e1a3ca

•

1 Parent(s): f88dd6a

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -48,7 +48,7 @@ Dataset of highly filtered and curated question and answer pairs. Release TBD.
 `lilloukas/Platypus-30B` was instruction fine-tuned using LoRA on 4 A100 80GB. For training details and inference instructions please see the [Platypus-30B](https://github.com/arielnlee/Platypus-30B.git) GitHub repo.
 ## Reproducing Evaluation Results
-Install LM Evaluation Harness
 ```
 git clone https://github.com/EleutherAI/lm-evaluation-harness
 cd lm-evaluation-harness
@@ -56,22 +56,22 @@ pip install -e .
 ```
 Each task was evaluated on a single A100 80GB GPU.
-ARC
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/Platypus-30B --tasks arc_challenge --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/arc_challenge_25shot.json --device cuda --num_fewshot 25
 ```
-HellaSwag
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/Platypus-30B --tasks hellaswag --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/hellaswag_10shot.json --device cuda --num_fewshot 10
 ```
-MMLU
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/Platypus-30B --tasks hendrycksTest-* --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/mmlu_5shot.json --device cuda --num_fewshot 5
 ```
-TruthfulQA
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/Platypus-30B --tasks truthfulqa_mc --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/truthfulqa_0shot.json --device cuda
 ```

 `lilloukas/Platypus-30B` was instruction fine-tuned using LoRA on 4 A100 80GB. For training details and inference instructions please see the [Platypus-30B](https://github.com/arielnlee/Platypus-30B.git) GitHub repo.
 ## Reproducing Evaluation Results
+Install LM Evaluation Harness:
 ```
 git clone https://github.com/EleutherAI/lm-evaluation-harness
 cd lm-evaluation-harness
 ```
 Each task was evaluated on a single A100 80GB GPU.
+ARC:
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/Platypus-30B --tasks arc_challenge --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/arc_challenge_25shot.json --device cuda --num_fewshot 25
 ```
+HellaSwag:
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/Platypus-30B --tasks hellaswag --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/hellaswag_10shot.json --device cuda --num_fewshot 10
 ```
+MMLU:
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/Platypus-30B --tasks hendrycksTest-* --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/mmlu_5shot.json --device cuda --num_fewshot 5
 ```
+TruthfulQA:
 ```
 python main.py --model hf-causal-experimental --model_args pretrained=lilloukas/Platypus-30B --tasks truthfulqa_mc --batch_size 1 --no_cache --write_out --output_path results/Platypus-30B/truthfulqa_0shot.json --device cuda
 ```