Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published 6 days ago • 82
Phantom of Latent for Large Language and Vision Models Paper • 2409.14713 • Published 15 days ago • 27
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper • 2409.12183 • Published 19 days ago • 35
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper • 2409.08264 • Published 25 days ago • 42
MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery Paper • 2409.05591 • Published 28 days ago • 26
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Paper • 2407.14057 • Published Jul 19 • 43