Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Paper • 2411.04996 • Published 2 days ago • 33
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems Paper • 2411.02959 • Published 5 days ago • 52
Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis Paper • 2411.01156 • Published 8 days ago • 4
Personalization of Large Language Models: A Survey Paper • 2411.00027 • Published 12 days ago • 28
Starbucks: Improved Training for 2D Matryoshka Embeddings Paper • 2410.13230 • Published 24 days ago • 9
Harnessing Webpage UIs for Text-Rich Visual Understanding Paper • 2410.13824 • Published 23 days ago • 29
LightRAG: Simple and Fast Retrieval-Augmented Generation Paper • 2410.05779 • Published Oct 8 • 1
MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents Paper • 2410.03450 • Published Oct 4 • 32
TableRAG: Million-Token Table Understanding with Language Models Paper • 2410.04739 • Published Oct 7 • 1
Horizon-Length Prediction: Advancing Fill-in-the-Middle Capabilities for Code Generation with Lookahead Planning Paper • 2410.03103 • Published Oct 4 • 6
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise Paper • 2410.03017 • Published Oct 3 • 25
EmbedLLM: Learning Compact Representations of Large Language Models Paper • 2410.02223 • Published Oct 3 • 3
Distilling an End-to-End Voice Assistant Without Instruction Training Data Paper • 2410.02678 • Published Oct 3 • 22
L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding? Paper • 2410.02115 • Published Oct 3 • 10
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging Paper • 2410.01215 • Published Oct 2 • 30
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models Paper • 2409.17481 • Published Sep 26 • 46