-
Large-scale Reinforcement Learning for Diffusion Models
Paper • 2401.12244 • Published • 28 -
Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Paper • 2311.15657 • Published • 2 -
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
Paper • 2402.08714 • Published • 10 -
Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning
Paper • 2402.06102 • Published • 4
Collections
Discover the best community collections!
Collections including paper arxiv:2402.03570
-
Diffusion World Model
Paper • 2402.03570 • Published • 7 -
Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
Paper • 2401.16335 • Published • 1 -
Towards Efficient and Exact Optimization of Language Model Alignment
Paper • 2402.00856 • Published -
ODIN: Disentangled Reward Mitigates Hacking in RLHF
Paper • 2402.07319 • Published • 13
-
Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models
Paper • 2312.09608 • Published • 13 -
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper • 2310.17680 • Published • 69 -
ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image
Paper • 2310.17994 • Published • 8 -
Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss
Paper • 2401.02677 • Published • 21
-
"I Want It That Way": Enabling Interactive Decision Support Using Large Language Models and Constraint Programming
Paper • 2312.06908 • Published • 5 -
CogAgent: A Visual Language Model for GUI Agents
Paper • 2312.08914 • Published • 29 -
Diffusion World Model
Paper • 2402.03570 • Published • 7 -
Fine-Tuned Language Models Generate Stable Inorganic Materials as Text
Paper • 2402.04379 • Published • 7