Yuheng Zhang's picture

1 2

Yuheng Zhang

MatouK98

AI & ML interests

None yet

Organizations

MatouK98's activity

commented a paper 4 months ago

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Paper • 2407.00617 • Published Jun 30 • 7 •