DELTA: Dense Efficient Long-range 3D Tracking for any video Paper • 2410.24211 • Published 10 days ago • 8
Learning Video Representations without Natural Videos Paper • 2410.24213 • Published 10 days ago • 14
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents Paper • 2410.22476 • Published 12 days ago • 24
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders Paper • 2410.22366 • Published 13 days ago • 71
Toxicity of the Commons: Curating Open-Source Pre-Training Data Paper • 2410.22587 • Published 11 days ago • 8
SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation Paper • 2410.23277 • Published 11 days ago • 7
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference Paper • 2410.21465 • Published 13 days ago • 9
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior Paper • 2410.21264 • Published 13 days ago • 8
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training Paper • 2410.19313 • Published 16 days ago • 18
GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation Paper • 2410.20474 • Published 14 days ago • 13
MarDini: Masked Autoregressive Diffusion for Video Generation at Scale Paper • 2410.20280 • Published 14 days ago • 21
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation Paper • 2410.18666 • Published 17 days ago • 17
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality Paper • 2410.19355 • Published 16 days ago • 20
ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting Paper • 2410.17856 • Published 18 days ago • 48
Allegro: Open the Black Box of Commercial-Level Video Generation Model Paper • 2410.15458 • Published 21 days ago • 40
Stable Consistency Tuning: Understanding and Improving Consistency Models Paper • 2410.18958 • Published 17 days ago • 9