@bwang0911 on Hugging Face: "In the vector search setup, we normally combine a fast embedding model and an…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

bwang0911

posted an update Apr 19

Post

2915

In the vector search setup, we normally combine a fast embedding model and an accurate but slow reranker model.

The newly released @jinaai rerankers are small in size and almost as accurate as our base reranker. This means given a time constraint, it can scoring more candidate documents from embedding models and have a better chance to feed LLM the correct context for RAG generation.

These models are available on Huggingface and has been integrated into the latest SentenceTransformers 2.7.0. Check it out!

jinaai/jina-reranker-v1-turbo-en
jinaai/jina-reranker-v1-tiny-en

tomaarsen

Apr 19

I quite enjoy the speed of these, well done.

In this post

bwang0911 Bo Wang
tomaarsen Tom Aarsen