You need to agree to share your contact information to access this model
This repository is publicly accessible, but you have to accept the conditions to access its files and content.
This model is just a test on French semantic with Mixtral. Unfortunately, I will not provide access to anyone. To be deleted soon!
Log in or Sign Up to review the conditions and access this model content.
outputs
This model is a fine-tuned version of mistralai/Mixtral-8x7B-v0.1 on an unknown dataset.
Model description
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.0002
- train_batch_size: 2
- eval_batch_size: 8
- seed: 3407
- gradient_accumulation_steps: 4
- total_train_batch_size: 8
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 5
- training_steps: 60
Training results
Framework versions
- PEFT 0.11.1
- Transformers 4.41.2
- Pytorch 2.3.0+cu121
- Datasets 2.20.0
- Tokenizers 0.19.1
- Downloads last month
- 0
Model tree for AkimfromParis/Mixtrale-Houellebeck
Base model
mistralai/Mixtral-8x7B-v0.1
Adapter
this model