Korean Reward Modeling - a heegyu Collection

heegyu 's Collections

Korean Reward Modeling

Korean Pretraining Dataset

Reward Modeling Datasets

Pre-training Dataset

Image Generation

Domain Specific (Math, Code, etc)

Machine Translation

Korean Reward Modeling

updated May 29

Korean Datasets, Reward Models for RLHF

heegyu/KoSafeGuard-8b-0503

Text Generation • Updated 16 days ago • 111 • 5
heegyu/ko-reward-model-helpful-1.3b-v0.2

Text Classification • Updated Jan 10 • 15
heegyu/ko-reward-model-safety-1.3b-v0.2

Text Classification • Updated Jan 13 • 15 • 5
heegyu/ko-reward-model-helpful-roberta-large-v0.1

Text Classification • Updated Dec 31, 2023 • 8 • 1
heegyu/ko-reward-model-safety-roberta-large-v0.1

Text Classification • Updated Dec 31, 2023 • 5
heegyu/ko-reward-model-1.3b-v0.1

Text Classification • Updated Dec 7, 2023 • 7 • 1
heegyu/ko-reward-model-1.3b-v0

Text Classification • Updated Dec 1, 2023 • 39 • 1
heegyu/ko-ultrafeedback-binarized-1.3b

Text Classification • Updated Nov 27, 2023 • 5 • 2
maywell/ko_Ultrafeedback_binarized

Viewer • Updated Nov 9, 2023 • 62k • 56 • 28
maywell/ko_hh-rlhf-20k_filtered

Viewer • Updated Nov 4, 2023 • 19.4k • 41 • 4
heegyu/hh-rlhf-ko

Viewer • Updated Dec 24, 2023 • 169k • 85 • 3
heegyu/PKU-SafeRLHF-ko

Viewer • Updated Dec 31, 2023 • 320k • 48 • 4
heegyu/webgpt_comparisons_ko

Viewer • Updated Dec 5, 2023 • 19.6k • 17 • 2
SJ-Donald/orca-dpo-pairs-ko

Viewer • Updated Jan 24 • 36k • 81 • 7