MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models Paper • 2410.13085 • Published 21 days ago • 20
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens Paper • 2410.13863 • Published 21 days ago • 35
PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment Paper • 2410.13785 • Published 21 days ago • 18
BenTo: Benchmark Task Reduction with In-Context Transferability Paper • 2410.13804 • Published 21 days ago • 20
MoH: Multi-Head Attention as Mixture-of-Head Attention Paper • 2410.11842 • Published 23 days ago • 20
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control Paper • 2410.13830 • Published 21 days ago • 23