Kai Zuberbühler's picture

294 271

Kai Zuberbühler

kaizuberbuehler

·

k-zubi

AI & ML interests

language models, agents, image generation, music generation

Organizations

None yet

kaizuberbuehler's activity

upvoted a collection 9 days ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 3 days ago • 89

upvoted a paper 14 days ago

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models

Paper • 2409.17066 • Published Sep 25 • 27

upvoted a paper 15 days ago

High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching

Paper • 2407.03648 • Published Jul 4 • 16

upvoted a paper 21 days ago

nGPT: Normalized Transformer with Representation Learning on the Hypersphere

Paper • 2410.01131 • Published Oct 1 • 8

upvoted a paper 23 days ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published 23 days ago • 86

upvoted 3 papers 25 days ago

On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability

Paper • 2409.19924 • Published Sep 30 • 1

TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation

Paper • 2410.03608 • Published Oct 4 • 1

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published 26 days ago • 9

upvoted a paper 27 days ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published about 1 month ago • 26

upvoted a paper 30 days ago

Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

Paper • 2406.14546 • Published Jun 20 • 2

upvoted 10 papers about 1 month ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 165

MuCodec: Ultra Low-Bitrate Music Codec

Paper • 2409.13216 • Published Sep 20 • 22

Imagine yourself: Tuning-Free Personalized Image Generation

Paper • 2409.13346 • Published Sep 20 • 67

A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?

Paper • 2409.15277 • Published Sep 23 • 34

MaskBit: Embedding-free Image Generation via Bit Tokens

Paper • 2409.16211 • Published Sep 24 • 16

Seeing Faces in Things: A Model and Dataset for Pareidolia

Paper • 2409.16143 • Published Sep 24 • 15

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24 • 24

MonoFormer: One Transformer for Both Diffusion and Autoregression

Paper • 2409.16280 • Published Sep 24 • 17

OmniBench: Towards The Future of Universal Omni-Language Models

Paper • 2409.15272 • Published Sep 23 • 25

Making Text Embedders Few-Shot Learners

Paper • 2409.15700 • Published Sep 24 • 29