liminalism (Lim)

upvoted a paper 6 months ago

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published Apr 30 • 73

upvoted 2 papers 7 months ago

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25 • 52

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22 • 124

upvoted a collection 7 months ago

OpenELM Instruct Models

Collection

4 items • Updated Oct 4 • 113

upvoted 3 papers 7 months ago

FlowMind: Automatic Workflow Generation with LLMs

Paper • 2404.13050 • Published Mar 17 • 32

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Paper • 2404.05961 • Published Apr 9 • 64

RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

Paper • 2403.05313 • Published Mar 8 • 9

upvoted an article 7 months ago

Article

CodeGemma - an official Google release for code LLMs

Apr 9

• 99

upvoted a paper 7 months ago

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4 • 89

upvoted 2 collections 7 months ago

🤖 Agents

Collection

16 items • Updated Sep 6 • 34

MoEs papers reading list

Collection

60 items • Updated 6 days ago • 134

upvoted a paper 7 months ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 104

upvoted 2 papers 8 months ago

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Paper • 2403.15042 • Published Mar 22 • 25

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 124

upvoted a paper 10 months ago

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8 • 157

upvoted a paper 11 months ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 258

Lim

AI & ML interests

Organizations

liminalism's activity

Better & Faster Large Language Models via Multi-token Prediction

Make Your LLM Fully Utilize the Context

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

OpenELM Instruct Models

FlowMind: Automatic Workflow Generation with LLMs

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

CodeGemma - an official Google release for code LLMs

ReFT: Representation Finetuning for Language Models

🤖 Agents

MoEs papers reading list

Jamba: A Hybrid Transformer-Mamba Language Model

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Mixtral of Experts

LLM in a flash: Efficient Large Language Model Inference with Limited Memory