5 32 54

Guille Pérez-Torró

guishe

https://www.linkedin.com/in/guipetor/

GuishePerez

AI & ML interests

Information Retrieval, Few-Shot Learning, Named Entity Recognition, Named Entity Disambiguation, Semantic Search, Aspect-based Sentiment Analysis

Organizations

None yet

guishe's activity

upvoted a collection 6 days ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated 6 days ago • 160

upvoted a paper 6 days ago

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2 • 19

upvoted an article 23 days ago

Article

How to build a custom text classifier without days of human labeling

•

24 days ago

• 54

upvoted 3 articles about 2 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23

• 54

Article

Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics

•

Jul 22

• 4

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22

• 61

upvoted a collection 3 months ago

4bit Instruct Models

Collection

18 items • Updated 28 days ago • 25

upvoted 2 articles 3 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 213

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29

• 237

upvoted 2 collections 3 months ago

Zeroshot Classifiers

Collection

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 112

ReLiK: Retrieve, Read and LinK

Collection

A blazing fast and lightweight Information Extraction model for Entity Linking and Relation Extraction. • 20 items • Updated Aug 8 • 20

upvoted a paper 4 months ago

Nomic Embed: Training a Reproducible Long Context Text Embedder

Paper • 2402.01613 • Published Feb 2 • 14

upvoted a collection 5 months ago

Instruction Pre-Training

Collection

8 items • Updated Jun 21 • 26

upvoted 7 papers 5 months ago

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7 • 55

Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP

Paper • 2212.14024 • Published Dec 28, 2022 • 3

DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

Paper • 2310.03714 • Published Oct 5, 2023 • 30

DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines

Paper • 2312.13382 • Published Dec 20, 2023 • 3

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 29

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Paper • 2406.01574 • Published Jun 3 • 42

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 79