Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
HuggingFaceH4
/
zephyr-orpo-141b-A35b-v0.1
like
260
Text Generation
Transformers
TensorBoard
Safetensors
argilla/distilabel-capybara-dpo-7k-binarized
mixtral
trl
orpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
arxiv:
2403.07691
arxiv:
2311.07911
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
15
Train
Deploy
Use this model
5351827
zephyr-orpo-141b-A35b-v0.1
/
README.md
Commit History
Update README.md
5351827
verified
lewtun
HF staff
commited on
Apr 11
Model save
cf74763
verified
lewtun
HF staff
commited on
Apr 10
End of training
6744fe7
verified
lewtun
HF staff
commited on
Apr 10
Model save
b08d449
verified
lewtun
HF staff
commited on
Apr 10