--- license: apache-2.0 language: - en - de - es - fr tags: - sft inference: false datasets: - OpenAssistant/oasst1 --- # Open-Assistant Llama2 70B SFT OASST This model is a fine-tuning of [Llama2 70B](https://huggingface.co/meta-llama/Llama-2-70b-hf) LLM. It was trained on a mixture of OASST top-1 threads. ## Model Details - **Finetuned from:** [Llama2 70B](https://huggingface.co/meta-llama/Llama-2-70b-hf) - **Model type:** Causal decoder-only transformer language model - **Language:** English, German, Spanish, French (and limited capabilities in Italian, Portuguese, Polish, Dutch, Romanian, Czech, Swedish); - **License:** Apache 2.0 - **Contact:** [Open-Assistant Discord](https://ykilcher.com/open-assistant-discord) ## Prompting Two special tokens are used to mark the beginning of user and assistant turns: `<|prompter|>` and `<|assistant|>`. Each turn ends with a `` token. Input prompt example: ``` <|prompter|>What is a meme, and what's the history behind this word?<|assistant|> ``` The input ends with the `<|assistant|>` token to signal that the model should start generating the assistant reply. # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard) Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_jordiclive__Llama-2-70b-oasst-1-200) | Metric | Value | |-----------------------|---------------------------| | Avg. | 57.11 | | ARC (25-shot) | 67.66 | | HellaSwag (10-shot) | 87.24 | | MMLU (5-shot) | 69.95 | | TruthfulQA (0-shot) | 51.28 | | Winogrande (5-shot) | 84.14 | | GSM8K (5-shot) | 32.75 | | DROP (3-shot) | 6.73 |