Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Qwen
/
Qwen2-VL-7B-Instruct
like
801
Follow
Qwen
1,842
Image-Text-to-Text
Transformers
Safetensors
English
qwen2_vl
multimodal
conversational
Inference Endpoints
arxiv:
2409.12191
arxiv:
2308.12966
License:
apache-2.0
Model card
Files
Files and versions
Community
48
Train
Deploy
Use this model
main
Qwen2-VL-7B-Instruct
/
README.md
Commit History
Update README.md
51c4743
verified
bluelike
commited on
Sep 21
add vcr results
cacb254
verified
bluelike
commited on
Sep 3
Update README.md (
#6
)
d776c71
verified
chenkq
jklj077
commited on
Sep 2
add correct pipeline tag (
#4
)
ccd09ac
verified
chenkq
RaushanTurganbay
HF staff
commited on
Aug 31
Update README.md (
#3
)
ed24a23
verified
chenkq
reach-vb
HF staff
commited on
Aug 30
Update README.md
8f9fc0b
verified
JustinLin610
commited on
Aug 29
Update README.md
6424504
verified
JustinLin610
commited on
Aug 29
Update README.md
6010982
verified
bluelike
commited on
Aug 29
Update README.md
b6241d7
verified
JustinLin610
commited on
Aug 29
Update README.md
1399c6f
verified
bluelike
commited on
Aug 29
Create README.md
e1b32dd
verified
bluelike
commited on
Aug 29