Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-v10-i1
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
mit
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
5abe628
zephyr-7b-gpo-v10-i1
Commit History
End of training
5abe628
verified
lole25
commited on
May 8
Model save
8fd7aad
verified
lole25
commited on
May 8
Training in progress, step 5200
53303be
verified
lole25
commited on
May 8
Training in progress, step 5100
7aa3b37
verified
lole25
commited on
May 8
Training in progress, step 5000
d30687d
verified
lole25
commited on
May 8
Training in progress, step 4900
337f968
verified
lole25
commited on
May 8
Training in progress, step 4500
0a9ca18
verified
lole25
commited on
May 8
Training in progress, step 4400
7e70c95
verified
lole25
commited on
May 8
Training in progress, step 4300
a7a62b4
verified
lole25
commited on
May 8
Training in progress, step 4200
b1f2917
verified
lole25
commited on
May 8
Training in progress, step 4000
3ad74a2
verified
lole25
commited on
May 8
Training in progress, step 3900
8df73d7
verified
lole25
commited on
May 8
Training in progress, step 3700
e37a2d8
verified
lole25
commited on
May 8
Training in progress, step 3600
feeedcf
verified
lole25
commited on
May 8
Training in progress, step 3400
8fd8863
verified
lole25
commited on
May 8
Training in progress, step 3100
b2e3f6b
verified
lole25
commited on
May 8
Training in progress, step 3000
bb3a41e
verified
lole25
commited on
May 8
Training in progress, step 2900
6cf3228
verified
lole25
commited on
May 8
Training in progress, step 2800
921e495
verified
lole25
commited on
May 8
Training in progress, step 2700
e017dc0
verified
lole25
commited on
May 8
Training in progress, step 2500
88adeec
verified
lole25
commited on
May 8
Training in progress, step 2400
9e7e3f8
verified
lole25
commited on
May 8
Training in progress, step 2200
ae0c124
verified
lole25
commited on
May 8
Training in progress, step 2100
cc09762
verified
lole25
commited on
May 8
Training in progress, step 2000
6d44ef4
verified
lole25
commited on
May 8
Training in progress, step 1900
a51c0bc
verified
lole25
commited on
May 8
Training in progress, step 1800
8d0292d
verified
lole25
commited on
May 8
Training in progress, step 1700
1fff136
verified
lole25
commited on
May 8
Training in progress, step 1500
ee73ebf
verified
lole25
commited on
May 8
Training in progress, step 1400
19cb440
verified
lole25
commited on
May 8
Training in progress, step 1300
a61e347
verified
lole25
commited on
May 8
Training in progress, step 1100
72959f4
verified
lole25
commited on
May 8
Training in progress, step 1000
825683c
verified
lole25
commited on
May 8
Training in progress, step 900
abcf6d2
verified
lole25
commited on
May 8
Training in progress, step 800
3c06c62
verified
lole25
commited on
May 8
Training in progress, step 600
edf0a10
verified
lole25
commited on
May 8
Training in progress, step 500
e888a28
verified
lole25
commited on
May 8
Training in progress, step 400
3c38793
verified
lole25
commited on
May 8
Training in progress, step 300
c924316
verified
lole25
commited on
May 8
Training in progress, step 100
f501631
verified
lole25
commited on
May 8
initial commit
9193d81
verified
lole25
commited on
May 8