jinaai
/

jina-clip-implementation

🇪🇺 Region: EU

Model card Files Files and versions Community

Edit model card

YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Jina CLIP

The Jina CLIP implementation is hosted in this repository. The model uses:

the EVA 02 architecture for the vision tower
the Jina BERT with Flash Attention model as a text tower

To use the Jina CLIP model, the following packages are required:

torch
timm
transformers
einops
xformers to use x-attention
flash-attn to use flash attention
apex to use fused layer normalization

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference API

Unable to determine this model's library. Check the docs .