Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-v3-4-i2
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-7b-gpo-v3-4-i2
Commit History
End of training
346051e
verified
lole25
commited on
May 20
Model save
b969ec9
verified
lole25
commited on
May 20
Training in progress, step 5100
f638968
verified
lole25
commited on
May 20
Training in progress, step 5000
bd37ac5
verified
lole25
commited on
May 20
Training in progress, step 4900
2555f0a
verified
lole25
commited on
May 20
Training in progress, step 4800
8538820
verified
lole25
commited on
May 20
Training in progress, step 4700
e11310f
verified
lole25
commited on
May 20
Training in progress, step 4600
3564d4a
verified
lole25
commited on
May 20
Training in progress, step 4500
f9bd093
verified
lole25
commited on
May 20
Training in progress, step 4400
f967829
verified
lole25
commited on
May 20
Training in progress, step 4300
61501eb
verified
lole25
commited on
May 20
Training in progress, step 4200
8b6c859
verified
lole25
commited on
May 20
Training in progress, step 4000
246f26e
verified
lole25
commited on
May 20
Training in progress, step 3900
d296b6a
verified
lole25
commited on
May 20
Training in progress, step 3800
dbbd362
verified
lole25
commited on
May 20
Training in progress, step 3700
c5c8886
verified
lole25
commited on
May 20
Training in progress, step 3400
408994a
verified
lole25
commited on
May 20
Training in progress, step 3200
ea93b52
verified
lole25
commited on
May 20
Training in progress, step 3100
ca212d6
verified
lole25
commited on
May 20
Training in progress, step 3000
15cd0b9
verified
lole25
commited on
May 20
Training in progress, step 2900
93f87cb
verified
lole25
commited on
May 20
Training in progress, step 2700
e5743a6
verified
lole25
commited on
May 20
Training in progress, step 2600
c531ea3
verified
lole25
commited on
May 20
Training in progress, step 2500
6d6c1fe
verified
lole25
commited on
May 20
Training in progress, step 2400
b73d208
verified
lole25
commited on
May 20
Training in progress, step 2300
9c27330
verified
lole25
commited on
May 20
Training in progress, step 2200
3a4c7ba
verified
lole25
commited on
May 20
Training in progress, step 2000
0d81e50
verified
lole25
commited on
May 20
Training in progress, step 1900
8209483
verified
lole25
commited on
May 20
Training in progress, step 1800
db35546
verified
lole25
commited on
May 20
Training in progress, step 1700
078870b
verified
lole25
commited on
May 20
Training in progress, step 1600
fa728d9
verified
lole25
commited on
May 20
Training in progress, step 1400
70254c0
verified
lole25
commited on
May 20
Training in progress, step 1300
1d8a7e3
verified
lole25
commited on
May 20
Training in progress, step 1100
77fe91c
verified
lole25
commited on
May 20
Training in progress, step 1000
7d89551
verified
lole25
commited on
May 20
Training in progress, step 900
382e3f8
verified
lole25
commited on
May 20
Training in progress, step 700
2aa042f
verified
lole25
commited on
May 20
Training in progress, step 600
49bbd68
verified
lole25
commited on
May 20
Training in progress, step 500
c5624df
verified
lole25
commited on
May 20
initial commit
caa49ed
verified
lole25
commited on
May 20