Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
File size: 135 Bytes
23ecbd4
 
 
1
2
3
4
version https://git-lfs.github.com/spec/v1
oid sha256:c5f4766cfd3db5d1c582bf949beafeaf1e72d5788b8b22858f5d704e54e150f7
size 3852615520