Reza2kn (Reza Sayar)

upvoted a paper about 18 hours ago

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published 1 day ago • 39

upvoted a collection 1 day ago

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 1 day ago • 134

upvoted a collection 12 days ago

xLAM models

Collection

xLAM: A Family of Large Action Models to Empower AI Agent Systems • 9 items • Updated 11 days ago • 40

upvoted a paper 20 days ago

Generative Verifiers: Reward Modeling as Next-Token Prediction

Paper • 2408.15240 • Published 23 days ago • 12

upvoted a paper 22 days ago

LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs

Paper • 2408.13467 • Published 27 days ago • 23

upvoted 5 articles about 1 month ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19

• 72

Article

Tool Use, Unified

Aug 12

• 49

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 96

Article

Your AI, Everywhere

By

•

Aug 9

• 10

Article

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

By

•

Aug 4

• 24

upvoted 4 articles 2 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 242

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 74

Article

Unleash ML Power on iOS: Apple Silicon Optimization Secrets

By

•

Jul 18

• 4

Article

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Jul 16

• 30

upvoted a collection 2 months ago

Product Catalog Generator

Collection

Product Catalog Generator for Persian products which is hosted by Basalam • 7 items • Updated 13 days ago • 8

upvoted an article 2 months ago

Article

The Great LLM Showdown: Amy's Quest for the Perfect LLM

By

•

Jul 9

• 12

upvoted a collection 2 months ago

Text-to-Speech

Collection

4 items • Updated 1 day ago • 1

upvoted a collection 3 months ago

Probably function calling datasets

Collection

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17 • 35

upvoted a paper 3 months ago

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 61

upvoted a collection 3 months ago

mHuBERT-147 models

Collection

Compact yet powerful multilingual speech representation models based on the HuBERT architecture. • 3 items • Updated Jun 4 • 5

upvoted 4 articles 3 months ago

Article

Fine-Tune Wav2Vec2 for English ASR with 🤗 Transformers

Mar 12, 2021

• 13

Article

BrAIn: next generation neurons?

By

•

Jun 5

• 15

Article

Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs

Jun 5

• 17

Article

Sales Forecasting with Image Regression

By

•

May 24

• 6

upvoted 2 articles 4 months ago

Article

Fine-Tune W2V2-Bert for low-resource ASR with 🤗 Transformers

Jan 19

• 11

Article

License to Call: Introducing Transformers Agents 2.0

May 13

• 108

upvoted a collection 4 months ago

PaliGemma Release

Collection

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Jul 31 • 133

upvoted an article 5 months ago

Article

Synthetic data: save money, time and carbon with open source

Feb 16

• 45

upvoted a paper 5 months ago

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published May 2 • 59

upvoted 3 articles 5 months ago

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1

• 61

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22

• 55

Article

Total noob’s intro to Hugging Face Transformers

Mar 22

• 38

upvoted a paper 5 months ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30 • 46

upvoted 3 articles 5 months ago

Article

Improving Prompt Consistency with Structured Generations

Apr 30

• 52

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

By

•

Jun 4

• 67

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

By

•

Apr 24

• 55

upvoted a paper 5 months ago

Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

Paper • 2404.12387 • Published Apr 18 • 38

upvoted a paper 7 months ago

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Paper • 2402.16671 • Published Feb 26 • 26

upvoted a collection 10 months ago

Seamless Communication

Collection

A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16 • 144

Reza Sayar PRO

AI & ML interests

Organizations

Reza2kn's activity

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

Tool Use, Unified

Welcome FalconMamba: The first strong attention-free 7B model

Your AI, Everywhere

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

SmolLM - blazingly fast and remarkably powerful

The Rise of Agentic Data Generation

Unleash ML Power on iOS: Apple Silicon Optimization Secrets

How we leveraged distilabel to create an Argilla 2.0 Chatbot

The Great LLM Showdown: Amy's Quest for the Perfect LLM

Fine-Tune Wav2Vec2 for English ASR with 🤗 Transformers

BrAIn: next generation neurons?

Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs

Sales Forecasting with Image Regression

Fine-Tune W2V2-Bert for low-resource ASR with 🤗 Transformers

License to Call: Introducing Transformers Agents 2.0

Synthetic data: save money, time and carbon with open source

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Total noob’s intro to Hugging Face Transformers

Improving Prompt Consistency with Structured Generations

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)