Multimodal Embeddings - a marcusinthesky Collection

marcusinthesky 's Collections

DS

Open-vocabulary object detection (OVD).

Multi-modal Mamba

Multimodal Embeddings

Tiny VLM Decoder

PeFT

Decoder Upcycled to Embeddings

Multimodal Embeddings

updated 18 days ago

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

Paper • 2403.19651 • Published Mar 28 • 23
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

Paper • 2404.04125 • Published Apr 4 • 27
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Paper • 2404.08197 • Published Apr 12 • 27
Gecko: Versatile Text Embeddings Distilled from Large Language Models

Paper • 2403.20327 • Published Mar 29 • 47
OpenGVLab/InternVL-14B-224px

Image Feature Extraction • Updated Aug 23 • 1.85k • 33
Alibaba-NLP/gte-large-en-v1.5

Sentence Similarity • Updated Aug 9 • 2.78M • 172
jinaai/jina-embeddings-v2-base-en

Feature Extraction • Updated Aug 6 • 80.3k • 691
castorini/repllama-v1.1-mrl-7b-lora-passage

Feature Extraction • Updated May 12 • 19 • 5
McGill-NLP/LLM2Vec-Sheared-LLaMA-mntp

Sentence Similarity • Updated May 21 • 7.08k • 4
BAAI/bge-visualized

Updated Mar 18 • 36
royokong/e5-v

Image-Text-to-Text • Updated 7 days ago • 8.99k • 18
TIGER-Lab/VLM2Vec-Full

Text Generation • Updated 5 days ago • 13.5k • 7
openbmb/VisRAG-Ret

Feature Extraction • Updated 2 days ago • 1.47k • 47