smajumdar94 (Somshubra Majumdar)

upvoted a collection 18 days ago

steiner-preview

Collection

Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated 21 days ago • 23

upvoted an article 24 days ago

Article

Fixing Gradient Accumulation

25 days ago

• 39

upvoted a paper about 1 month ago

CursorCore: Assist Programming through Aligning Anything

Paper • 2410.07002 • Published Oct 9 • 13

upvoted 2 articles about 1 month ago

Article

Welcome, Gradio 5

Oct 9

• 66

Article

Accelerate 1.0.0

Sep 13

• 50

upvoted a collection about 2 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 307

upvoted an article about 2 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 198

upvoted 2 papers 2 months ago

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Paper • 2409.03810 • Published Sep 5 • 30

FLUX that Plays Music

Paper • 2409.00587 • Published Sep 1 • 31

upvoted 4 papers 3 months ago

upvoted a collection 4 months ago

Minitron

Collection

A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated Oct 3 • 59

upvoted an article 4 months ago

Article

Announcing BigCodeBench-Hard, and More

By

•

Jul 24

• 10

upvoted 2 papers 4 months ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3 • 44

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28 • 94

upvoted an article 5 months ago

Article

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

By

•

Jul 9

• 36

upvoted a paper 5 months ago

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19 • 16

upvoted an article 5 months ago

Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Jun 18

• 39

Somshubra Majumdar

AI & ML interests

Organizations

smajumdar94's activity

steiner-preview

Fixing Gradient Accumulation

CursorCore: Assist Programming through Aligning Anything

Welcome, Gradio 5

Accelerate 1.0.0

Qwen2.5

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

FLUX that Plays Music

FocusLLM: Scaling LLM's Context by Parallel Decoding

To Code, or Not To Code? Exploring Impact of Code in Pre-training

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Minitron

Announcing BigCodeBench-Hard, and More

AgentInstruct: Toward Generative Teaching with Agentic Flows

Scaling Synthetic Data Creation with 1,000,000,000 Personas

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Somshubra Majumdar

AI & ML interests

Organizations

smajumdar94's activity

Fixing Gradient Accumulation

Welcome, Gradio 5

Accelerate 1.0.0

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Announcing BigCodeBench-Hard, and More

BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡