Why Does the Effective Context Length of LLMs Fall Short? Paper • 2410.18745 • Published 13 days ago • 16
Scaling Diffusion Language Models via Adaptation from Autoregressive Models Paper • 2410.17891 • Published 14 days ago • 15
Diffusion Model Alignment Using Direct Preference Optimization Paper • 2311.12908 • Published Nov 21, 2023 • 47