Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-v1-i3
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-7b-gpo-v1-i3
Commit History
End of training
a803ed3
verified
lole25
commited on
May 13
Model save
d26c8c4
verified
lole25
commited on
May 13
Training in progress, step 3700
6ff621a
verified
lole25
commited on
May 13
Training in progress, step 3600
3257e25
verified
lole25
commited on
May 13
Training in progress, step 3500
cb764c2
verified
lole25
commited on
May 13
Training in progress, step 3200
321c35d
verified
lole25
commited on
May 13
Training in progress, step 3000
00ff340
verified
lole25
commited on
May 13
Training in progress, step 2900
4acb74c
verified
lole25
commited on
May 13
Training in progress, step 2800
052e481
verified
lole25
commited on
May 13
Training in progress, step 2700
f31c746
verified
lole25
commited on
May 13
Training in progress, step 2600
f3817d9
verified
lole25
commited on
May 13
Training in progress, step 2500
5883f85
verified
lole25
commited on
May 13
Training in progress, step 2400
0629400
verified
lole25
commited on
May 13
Training in progress, step 2300
57a461a
verified
lole25
commited on
May 13
Training in progress, step 2200
13b787c
verified
lole25
commited on
May 13
Training in progress, step 2100
e805ed5
verified
lole25
commited on
May 13
Training in progress, step 2000
4941c77
verified
lole25
commited on
May 13
Training in progress, step 1900
356e4a5
verified
lole25
commited on
May 13
Training in progress, step 1600
93efbbe
verified
lole25
commited on
May 13
Training in progress, step 1500
faaf6e4
verified
lole25
commited on
May 13
Training in progress, step 1400
3e23f00
verified
lole25
commited on
May 13
Training in progress, step 1300
de2fedc
verified
lole25
commited on
May 13
Training in progress, step 1200
9ce7b16
verified
lole25
commited on
May 13
Training in progress, step 1100
f802154
verified
lole25
commited on
May 13
Training in progress, step 1000
6019c75
verified
lole25
commited on
May 13
Training in progress, step 800
3229149
verified
lole25
commited on
May 13
Training in progress, step 700
3f0cfaa
verified
lole25
commited on
May 13
Training in progress, step 600
b3ac972
verified
lole25
commited on
May 13
Training in progress, step 400
b64d598
verified
lole25
commited on
May 13
Training in progress, step 100
86ab9f1
verified
lole25
commited on
May 13
initial commit
701e048
verified
lole25
commited on
May 13