arco-reflection / README.md
appvoid's picture
Update README.md
0e5c6c9 verified
|
raw
history blame
No virus
478 Bytes
metadata
base_model: appvoid/arco
language:
  - en
license: apache-2.0
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - llama
  - trl
  - sft

this model is fine-tuned (and potentially overfitted) version of arco on a small reflection dataset.

the model works best with this format:

You are an AI system that returns a good <output> based on the reasoning made, always remember to return an <output> tag at the end. Instruction: <your prompt goes here>
<thinking>