metadata
language:
- en
- sw
license: apache-2.0
tags:
- text-generation-inference
- transformers
- unsloth
- mistral
- trl
base_model: LeroyDyer/Mixtral_AI_MiniTron_II
datasets:
- iamshnoo/alpaca-cleaned-swahili
library_name: transformers
Uploaded model
- Developed by: LeroyDyer
- License: apache-2.0
- Finetuned from model : LeroyDyer/Mixtral_AI_MiniTron_II
This is a smaller model easier for fine tuning !! (faster) This model was created from a fresh untrained model and has only been trained with swahili : it is still training!
Plus it will run and train on the laptop no problem ! (only with text corpuses the context needs to be low as it will force the gpu to consume memory so small articles only; later after intensive training the context can be re-extended etc: )
This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.