7 14 1

Daniel Korat

danielkorat

AI & ML interests

Inference acceleration, Low-resource NLP, Few-shot learning

Articles

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

Jan 30

• 4

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

Dec 6, 2023

• 6

SetFit: Efficient Few-Shot Learning Without Prompts

Sep 26, 2022

• 18

Organizations

danielkorat's activity

upvoted 3 articles about 1 month ago

Article

Assisted Generation: a new direction toward low-latency text generation

May 11, 2023

• 28

Article

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Apr 3

• 9

Article

Faster Assisted Generation with Dynamic Speculation

Oct 8

• 30

upvoted an article 3 months ago

Article

SetFit: Efficient Few-Shot Learning Without Prompts

Sep 26, 2022

• 18

upvoted a paper 3 months ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5 • 33

upvoted an article 4 months ago

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1

• 46

upvoted an article 5 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28

• 155

upvoted 2 papers 6 months ago

Accelerating Speculative Decoding using Dynamic Speculation Length

Paper • 2405.04304 • Published May 7 • 2

Distributed Speculative Inference of Large Language Models

Paper • 2405.14105 • Published May 23 • 16

upvoted 2 articles 6 months ago

Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

May 9

• 11

Article

Introducing the Open Leaderboard for Hebrew LLMs!

May 5

• 32

upvoted a paper 9 months ago

Improving Classification Performance With Human Feedback: Label a few, we label the rest

Paper • 2401.09555 • Published Jan 17 • 6

upvoted a paper over 1 year ago

H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Paper • 2306.14048 • Published Jun 24, 2023 • 11

Daniel Korat

AI & ML interests

Articles

Universal Assisted Generation: Faster Decoding with Any Assistant Model

Faster Assisted Generation with Dynamic Speculation

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

SetFit: Efficient Few-Shot Learning Without Prompts

Organizations

danielkorat's activity

Assisted Generation: a new direction toward low-latency text generation

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Faster Assisted Generation with Dynamic Speculation

SetFit: Efficient Few-Shot Learning Without Prompts

Our Transformers Code Agent beats the GAIA benchmark!

Training and Finetuning Embedding Models with Sentence Transformers v3

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

Introducing the Open Leaderboard for Hebrew LLMs!