metadata

license: apache-2.0
language:
  - en
  - de
  - es
  - fr
tags:
  - sft
inference: false
datasets:
  - OpenAssistant/oasst1

Open-Assistant Llama2 70B SFT OASST

This model is a fine-tuning of Llama2 70B LLM. It was trained on a mixture of OASST top-1 threads.

Model Details

Finetuned from: Llama2 70B
Model type: Causal decoder-only transformer language model
Language: English, German, Spanish, French (and limited capabilities in Italian, Portuguese, Polish, Dutch, Romanian, Czech, Swedish);
License: Apache 2.0
Contact: Open-Assistant Discord

Prompting

Two special tokens are used to mark the beginning of user and assistant turns: <|prompter|> and <|assistant|>. Each turn ends with a </s> token.

Input prompt example:

<|prompter|>What is a meme, and what's the history behind this word?</s><|assistant|>

The input ends with the <|assistant|> token to signal that the model should start generating the assistant reply.