VidPanos: Generative Panoramic Videos from Casual Panning Videos Paper β’ 2410.13832 β’ Published 23 days ago β’ 12
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. β’ 6 items β’ Updated 25 days ago β’ 129
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Paper β’ 2410.10792 β’ Published 26 days ago β’ 26
Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper β’ 2410.10306 β’ Published 27 days ago β’ 50
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation Paper β’ 2410.07171 β’ Published Oct 9 β’ 41
TextToon: Real-Time Text Toonify Head Avatar from Single Video Paper β’ 2410.07160 β’ Published Sep 23 β’ 8
TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation Paper β’ 2410.05591 β’ Published Oct 8 β’ 13
π Leaderboards & Arenas ζθ‘ζ¦εθ―ζ΅εΊε Collection 19 items β’ Updated 17 days ago β’ 5
DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion Paper β’ 2409.17145 β’ Published Sep 25 β’ 13
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation Paper β’ 2409.18964 β’ Published Sep 27 β’ 25
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness Paper β’ 2409.18125 β’ Published Sep 26 β’ 33
Synchronize Dual Hands for Physics-Based Dexterous Guitar Playing Paper β’ 2409.16629 β’ Published Sep 25 β’ 9
Molmo Collection Artifacts for open multimodal language models. β’ 5 items β’ Updated Sep 26 β’ 269
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts Paper β’ 2409.16040 β’ Published Sep 24 β’ 13
MonoFormer: One Transformer for Both Diffusion and Autoregression Paper β’ 2409.16280 β’ Published Sep 24 β’ 17
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling Paper β’ 2409.16160 β’ Published Sep 24 β’ 32