base_model: appvoid/arco | |
language: | |
- en | |
license: apache-2.0 | |
tags: | |
- text-generation-inference | |
- transformers | |
- unsloth | |
- llama | |
- trl | |
- sft | |
this model is fine-tuned (and potentially overfitted) version of arco on a small reflection dataset. | |
the model works best with this format: | |
``` | |
You are an AI system that returns a good <output> based on the reasoning made, always remember to return an <output> tag at the end. Instruction: <your prompt goes here> | |
<thinking> | |
``` |