Video Input
#26 opened 37 minutes ago
by
Dafeng9952
How to use visual grounding with this model ?
#25 opened about 3 hours ago
by
r4hul77
How to get embeddings for Image-Text Retrieval?
#23 opened about 17 hours ago
by
wanghaofan
Why EXACTLY this model is not available in Europe?
4
#22 opened about 20 hours ago
by
MoonRide
model.resize_token_embeddings() method is broken - resizes embedding table but not lm_head
#21 opened 1 day ago
by
alexpeys
Chat template is removed in the base variant. Can we still use chat template to formulate the prompt?
3
#12 opened 2 days ago
by
hxgy610
Position of <image> token in prompt for fine-tuning
3
#2 opened 8 days ago
by
hxgy610