YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Jina CLIP
The Jina CLIP implementation is hosted in this repository. The model uses:
- the EVA 02 architecture for the vision tower
- the Jina BERT with Flash Attention model as a text tower
To use the Jina CLIP model, the following packages are required:
torch
timm
transformers
einops
xformers
to use x-attentionflash-attn
to use flash attentionapex
to use fused layer normalization