datasets: - Anthropic/hh-rlhf
A stubby li'l gpt2 classifier trained on Anthropic's hh-rlhf dataset.
hh-rlhf