Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
umarigan
's Collections
DPO Dataset
Computer Vision Datasets
Domain Spec. Datasets
Turkish Datasets
TR Models
Turkish LLM Fine-Tune Datasets
DPO Dataset
updated
Mar 20
direct preference optimization related datasets
Upvote
-
argilla/reward-model-data-falcon
Viewer
•
Updated
Jun 7, 2023
•
7.4k
•
43
•
1
jondurbin/gutenberg-dpo-v0.1
Viewer
•
Updated
Jan 12
•
918
•
1.78k
•
124
ybisk/piqa
Updated
Jan 18
•
116k
•
85
Dahoas/rm-hh-rlhf
Viewer
•
Updated
Dec 22, 2022
•
89.5k
•
146
•
2
duxx/distilabel-intel-orca-dpo-pairs-tr
Viewer
•
Updated
Feb 5
•
3.98k
•
49
•
6
Dahoas/rm_instruct_helpful_preferences
Viewer
•
Updated
Mar 1, 2023
•
90.7k
•
42
•
4
Dahoas/1B_hh_sft_ppo_comparison
Viewer
•
Updated
Jan 26, 2023
•
100
•
8
abacusai/MetaMath_DPO_FewShot
Viewer
•
Updated
Feb 26
•
395k
•
127
•
25
abacusai/HellaSwag_DPO_FewShot
Viewer
•
Updated
Feb 26
•
150k
•
56
•
8
Upvote
-
Share collection
View history
Collection guide
Browse collections