meta-llama
/

Llama-3.2-11B-Vision

Image-Text-to-Text

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (19)

Video Input

#26 opened 37 minutes ago by

How to use visual grounding with this model ?

#25 opened about 3 hours ago by

How to get embeddings for Image-Text Retrieval?

#23 opened about 17 hours ago by

Why EXACTLY this model is not available in Europe?

#22 opened about 20 hours ago by

model.resize_token_embeddings() method is broken - resizes embedding table but not lm_head

#21 opened 1 day ago by

Chat template is removed in the base variant. Can we still use chat template to formulate the prompt?

#12 opened 2 days ago by

Position of <image> token in prompt for fine-tuning

#2 opened 8 days ago by