Visual Question Answering
Transformers
English
videollama2_mistral
text-generation
multimodal large language model
large video-language model
Inference Endpoints
ClownRat's picture
initial commit
634b54d verified
|
raw
history blame
31 Bytes
metadata
license: apache-2.0