Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nomadrp
/
dpo_model
like
0
PEFT
Safetensors
trl
dpo
Generated from Trainer
License:
llama3.1
Model card
Files
Files and versions
Community
Use this model
1abb5a7
dpo_model
Commit History
Training in progress, step 6100
1abb5a7
verified
nomadrp
commited on
Aug 22
Training in progress, step 6000
3a147f2
verified
nomadrp
commited on
Aug 22
Training in progress, step 5500
d7721b4
verified
nomadrp
commited on
Aug 22
Training in progress, step 5000
3187fbc
verified
nomadrp
commited on
Aug 22
Training in progress, step 4500
c9612a1
verified
nomadrp
commited on
Aug 22
Training in progress, step 4000
272fd41
verified
nomadrp
commited on
Aug 22
Training in progress, step 3500
70daf08
verified
nomadrp
commited on
Aug 22
Training in progress, step 3000
bc3c375
verified
nomadrp
commited on
Aug 22
Training in progress, step 2500
f187465
verified
nomadrp
commited on
Aug 22
Training in progress, step 2000
0a6dcaf
verified
nomadrp
commited on
Aug 22
Training in progress, step 1500
e8cbfcf
verified
nomadrp
commited on
Aug 22
Training in progress, step 1000
a870dec
verified
nomadrp
commited on
Aug 22
Training in progress, step 500
5f6e5b3
verified
nomadrp
commited on
Aug 22
initial commit
580cd14
verified
nomadrp
commited on
Aug 22