Remember, Retrieve and Generate: Understanding Infinite Visual Concepts as Your Personalized Assistant Paper • 2410.13360 • Published 23 days ago • 8
MedMobile: A mobile-sized language model with expert-level clinical capabilities Paper • 2410.09019 • Published 28 days ago • 8
Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation Paper • 2410.13198 • Published 23 days ago • 9
BenTo: Benchmark Task Reduction with In-Context Transferability Paper • 2410.13804 • Published 22 days ago • 20
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control Paper • 2410.13830 • Published 22 days ago • 23
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens Paper • 2410.13863 • Published 22 days ago • 35
MoH: Multi-Head Attention as Mixture-of-Head Attention Paper • 2410.11842 • Published 24 days ago • 20
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models Paper • 2410.13085 • Published 23 days ago • 20
PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment Paper • 2410.13785 • Published 22 days ago • 18
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization Paper • 2410.08815 • Published 28 days ago • 41