HuggingFaceH4/orca_dpo_pairs
Viewer
•
Updated
•
12.9k
•
149
•
24
A collection of datasets and models used for the Aligning LLMs with Direct Preference Optimization Methods blogpost