XMHe's picture

51 168

XMHe

Northrend

·

AI & ML interests

None yet

Organizations

None yet

Northrend's activity

upvoted 6 papers 7 days ago

Minimum Entropy Coupling with Bottleneck

Paper • 2410.21666 • Published 12 days ago • 4

DELTA: Dense Efficient Long-range 3D Tracking for any video

Paper • 2410.24211 • Published 10 days ago • 8

Learning Video Representations without Natural Videos

Paper • 2410.24213 • Published 10 days ago • 14

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published 10 days ago • 20

A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents

Paper • 2410.22476 • Published 12 days ago • 24

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published 13 days ago • 71

upvoted 2 papers 9 days ago

Toxicity of the Commons: Curating Open-Source Pre-Training Data

Paper • 2410.22587 • Published 11 days ago • 8

SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation

Paper • 2410.23277 • Published 11 days ago • 7

upvoted a paper 10 days ago

ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Paper • 2410.21465 • Published 13 days ago • 9

upvoted 6 papers 11 days ago

LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior

Paper • 2410.21264 • Published 13 days ago • 8

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published 16 days ago • 18

GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation

Paper • 2410.20474 • Published 14 days ago • 13

MarDini: Masked Autoregressive Diffusion for Video Generation at Scale

Paper • 2410.20280 • Published 14 days ago • 21

A Survey of Small Language Models

Paper • 2410.20011 • Published 15 days ago • 36

GPT-4o System Card

Paper • 2410.21276 • Published 16 days ago • 76

upvoted 3 papers 12 days ago

DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation

Paper • 2410.18666 • Published 17 days ago • 17

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Paper • 2410.19355 • Published 16 days ago • 20

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting

Paper • 2410.17856 • Published 18 days ago • 48

upvoted a paper 13 days ago

Allegro: Open the Black Box of Commercial-Level Video Generation Model

Paper • 2410.15458 • Published 21 days ago • 40

upvoted a paper 14 days ago

Stable Consistency Tuning: Understanding and Improving Consistency Models

Paper • 2410.18958 • Published 17 days ago • 9