How to finetune this model?

by Labmem009 - opened Sep 9

Sep 9

Interesting work!
Could u pls offer the finetune code or script, so I can use this jp-VLM in downstream tasks?
Thanks a lot!

AXCXEPT

Owner Sep 10

This first improves the FullFinetuning of the language part using the very common DPOTrainer.
This is trl's DPOTrainer.
BETA was done with 0.1 and all we need to do is create a dataset, Chosen and Rejected, and use the dataset with Score.

After that, the images were trained using STIC.
https://github.com/yihedeng9/STIC

Reference.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment