Training-free Regional Prompting for Diffusion Transformers Paper • 2411.02395 • Published 6 days ago • 22
How Far is Video Generation from World Model: A Physical Law Perspective Paper • 2411.02385 • Published 6 days ago • 27
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper • 2410.23218 • Published 11 days ago • 43
Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning Paper • 2410.22304 • Published 12 days ago • 14
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs Paper • 2311.04901 • Published Nov 8, 2023 • 7