Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
NexaAIDev
/
omnivision-968M
like
121
Follow
Nexa AI
118
GGUF
multimodal
conversational
GGUF
Image-Text-to-Text
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
3
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Text/vision parameter split
1
#3 opened 1 day ago by
AlexThompson
How do you encode an image in only 81 tokens?
1
#2 opened 1 day ago by
ChristineLai
about ocr
1
#1 opened 1 day ago by
MiaHawthorne