LeroyDyer's picture
Update README.md
5c8ae90 verified
|
raw
history blame
1.09 kB
---
language:
- en
- sw
license: apache-2.0
tags:
- text-generation-inference
- transformers
- unsloth
- mistral
- trl
base_model: LeroyDyer/Mixtral_AI_MiniTron_II
datasets:
- iamshnoo/alpaca-cleaned-swahili
library_name: transformers
---
# Uploaded model
- **Developed by:** LeroyDyer
- **License:** apache-2.0
- **Finetuned from model :** LeroyDyer/Mixtral_AI_MiniTron_II
This is a smaller model easier for fine tuning !! (faster)
This model was created from a fresh untrained model and has only been trained with swahili : it is still training!
Plus it will run and train on the laptop no problem ! (only with text corpuses the context needs to be low as it will force the gpu to consume memory so small articles only; later after intensive training the context can be re-extended etc:
)
This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)