Critique-out-Loud Reward Models Paper: https://arxiv.org/abs/2408.11791 | Code: https://github.com/zankner/CLoud ankner/Llama3-8B-CLoud-RM Updated 25 days ago • 468 ankner/Llama3-8B-Classic-RM Updated 24 days ago • 149 ankner/Llama3-70B-CLoud-RM Updated 22 days ago • 8 • 1 ankner/Llama3-70B-Classic-RM Updated 22 days ago • 10