Alexander Visheratin's picture

Alexander Visheratin

visheratin

·

AI & ML interests

None yet

Articles

Data exploration and filtering with Nomic Atlas

Breaking resolution curse of vision-language models

Organizations

Posts 5

Post

3162

Yesterday, xAI announced Grok-1.5 Vision - https://x.ai/blog/grok-1.5v. But more importantly, they also released a new VLM benchmark dataset - RealWorldQA. The only problem was that they released it as a ZIP archive. I fixed that! Now you can use it in your evaluations as a regular HF dataset: visheratin/realworldqa

Post

1913

Look at the beauty in the video — four different embeddings on the same map! In another community blog post, I explore how you can use Nomic Atlas to view and clean your dataset. You can check it out here - https://huggingface.co/blog/visheratin/nomic-data-cleaning

Papers 1

arxiv:2309.01859

spaces 2

Mc Llava 3b

Laion Nllb

models 18

visheratin/nllb-siglip-i18n

Zero-Shot Image Classification • Updated Jun 3 • 14

visheratin/nllb-clip-large-siglip

Zero-Shot Image Classification • Updated May 3 • 1.24k • 2

visheratin/nllb-clip-base-siglip

Zero-Shot Image Classification • Updated May 3 • 1.23k • 1

visheratin/mc-llava-3b-ft

Feature Extraction • Updated Mar 24 • 2

visheratin/nllb-siglip-mrl-large

Zero-Shot Image Classification • Updated Mar 10 • 2.02k • 11

visheratin/nllb-siglip-mrl-base

Zero-Shot Image Classification • Updated Mar 10 • 2.12k • 8

visheratin/MC-LLaVA-3b

Updated Feb 28 • 231 • 83

visheratin/nllb-clip-large-oc

Zero-Shot Image Classification • Updated Oct 24, 2023 • 85 • 2

visheratin/nllb-clip-base-oc

Zero-Shot Image Classification • Updated Oct 24, 2023 • 68 • 1

visheratin/nllb-clip-base

Updated Oct 11, 2023 • 139 • 4

datasets 11

visheratin/documentation-images

Viewer • Updated Apr 16 • 1 • 5.14k

visheratin/realworldqa

Viewer • Updated Apr 13 • 765 • 134 • 30

visheratin/laion-coco-nllb

Viewer • Updated Apr 11 • 894k • 1.11k • 39

visheratin/nllb-coco-long

Viewer • Updated Apr 9 • 45.7k • 78

visheratin/SVIT

Viewer • Updated Mar 31 • 108k • 43

visheratin/google_landmarks_photos

Viewer • Updated Mar 19 • 1.27M • 53 • 2

visheratin/object_questions

Viewer • Updated Mar 17 • 132k • 51

visheratin/uber_text_qa

Viewer • Updated Mar 16 • 9.98k • 66 • 1

visheratin/google_landmarks_places

Viewer • Updated Mar 16 • 35.1k • 80 • 2

visheratin/unsplash-caption-questions-init

Viewer • Updated Feb 28 • 24.9k • 45