Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Xiaodong
/
llava_dpo_17k_flash-attn_DPO_iter3_8k_data
like
0
Safetensors
Xiaodong/DPO-iter3-data-8k
Model card
Files
Files and versions
Community
Edit model card
wandb:
https://wandb.ai/xiaodongwang/llava-next-jf-4A100/runs/6d830d21/overview
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference API
Unable to determine this model's library. Check the
docs
.
Model tree for
Xiaodong/llava_dpo_17k_flash-attn_DPO_iter3_8k_data
Base model
Xiaodong/Next-DPO-iter1
Finetuned
Xiaodong/Next-DPO-iter2
Finetuned
(
1
)
this model
Dataset used to train
Xiaodong/llava_dpo_17k_flash-attn_DPO_iter3_8k_data
Xiaodong/DPO-iter3-data-8k
Viewer
•
Updated
Oct 13
•
8k
•
11
Collection including
Xiaodong/llava_dpo_17k_flash-attn_DPO_iter3_8k_data
VLMM-DPO-data
Collection
some dpo perference data
•
8 items
•
Updated
18 days ago