-
Random Field Augmentations for Self-Supervised Representation Learning
Paper • 2311.03629 • Published • 6 -
Levels of AGI: Operationalizing Progress on the Path to AGI
Paper • 2311.02462 • Published • 33 -
Idempotent Generative Network
Paper • 2311.01462 • Published • 24 -
E3 TTS: Easy End-to-End Diffusion-based Text to Speech
Paper • 2311.00945 • Published • 14
Collections
Discover the best community collections!
Collections including paper arxiv:2311.00895
-
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 40 -
Data Filtering Networks
Paper • 2309.17425 • Published • 6 -
FlashDecoding++: Faster Large Language Model Inference on GPUs
Paper • 2311.01282 • Published • 35 -
E3 TTS: Easy End-to-End Diffusion-based Text to Speech
Paper • 2311.00945 • Published • 14
-
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Paper • 2309.04269 • Published • 32 -
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
Paper • 2309.04663 • Published • 5 -
Effective Long-Context Scaling of Foundation Models
Paper • 2309.16039 • Published • 30 -
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts
Paper • 2310.11784 • Published • 10
-
Large-Scale Automatic Audiobook Creation
Paper • 2309.03926 • Published • 53 -
FoleyGen: Visually-Guided Audio Generation
Paper • 2309.10537 • Published • 8 -
MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Paper • 2310.11954 • Published • 24 -
UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Paper • 2310.00704 • Published • 19