Visual Question Decomposition on Multimodal Large Language Models Paper • 2409.19339 • Published 9 days ago • 7
Cottention: Linear Transformers With Cosine Attention Paper • 2409.18747 • Published 10 days ago • 14
INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding Paper • 2409.06210 • Published 27 days ago • 24