arco-reflection / README.md
appvoid's picture
Update README.md
63b95e4 verified
|
raw
history blame
669 Bytes
metadata
base_model: appvoid/arco
language:
  - en
license: apache-2.0
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - llama
  - trl
  - sft

this model is fine-tuned (and potentially overfitted) version of arco on a small reflection dataset.

the model works best with this format:

You are an AI system that returns a good <output> based on the reasoning made, always remember to return an <output> tag at the end. Instruction: <your prompt goes here>
<thinking>

as a mistake, the model is unable to understand when to stop so should be set as stop criteria to avoid the model continue generating text, further versions won't have this issue.