Visual Question Answering
Transformers
English
videollama2_mistral
text-generation
multimodal large language model
large video-language model
Inference Endpoints
File size: 31 Bytes
634b54d
 
 
1
2
3
4
---

license: apache-2.0
---