jtatman
/

gpt2-open-instruct-v1-Anthropic-hh-rlhf

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

jtatman commited on Jul 22, 2023

Commit

fc55243

•

1 Parent(s): 114a265

Filled out card

Files changed (1) hide show

README.md +13 -5

README.md CHANGED Viewed

@@ -13,21 +13,29 @@ should probably proofread and complete it, then remove this comment. -->
 # hh-rlhf
-This model is a fine-tuned version of [vicgalle/gpt2-open-instruct-v1](https://huggingface.co/vicgalle/gpt2-open-instruct-v1) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 2.1534
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure

 # hh-rlhf
+This model is a fine-tuned version of [vicgalle/gpt2-open-instruct-v1](https://huggingface.co/vicgalle/gpt2-open-instruct-v1) on an subset (15k) of the Anthropic/hh-rlhf dataset.
 It achieves the following results on the evaluation set:
 - Loss: 2.1534
 ## Model description
+GPT2 open instruct was trained on the open-instruct dataset fully. The reimagines one LM head as a partial rhlf adapter, with subtle reinforcements.
 ## Intended uses & limitations
+Intended to study the intersection of instruct models and prompting that focuses on subtle exchanges of prompting. This probably needs to be refined substantially at this point.
 ## Training and evaluation data
+Train dataset size: 15000
+Test dataset size: 500
+Dataset({
+    features: ['chosen', 'rejected'],
+    num_rows: 15000
+})
+Dataset({
+    features: ['chosen', 'rejected'],
+    num_rows: 500
+})
 ## Training procedure