Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
QinLiuNLP
/
llama3-sudo-dpo-instruct-5epochs-jxkey
like
0
PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
License:
llama3
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
llama3-sudo-dpo-instruct-5epochs-jxkey
/
adapter_model.safetensors
Commit History
Training in progress, step 1145
07bb642
verified
Qin Liu
commited on
Sep 11
Training in progress, step 1100
ce3ecaf
verified
Qin Liu
commited on
Sep 11
Training in progress, step 1000
5727986
verified
Qin Liu
commited on
Sep 11
Training in progress, step 900
e6f3b89
verified
Qin Liu
commited on
Sep 11
Training in progress, step 800
f7fe920
verified
Qin Liu
commited on
Sep 11
Training in progress, step 700
7c689bf
verified
Qin Liu
commited on
Sep 11
Training in progress, step 600
8b4ba65
verified
Qin Liu
commited on
Sep 11
Training in progress, step 500
005780e
verified
Qin Liu
commited on
Sep 11
Training in progress, step 400
93194e3
verified
Qin Liu
commited on
Sep 11
Training in progress, step 300
f34d21c
verified
Qin Liu
commited on
Sep 11
Training in progress, step 200
56c0945
verified
Qin Liu
commited on
Sep 11
Training in progress, step 100
f2faf78
verified
Qin Liu
commited on
Sep 11