SurgeGlobal
/

OpenBezoar-HH-RLHF-SFT

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chansurgeplus commited on Apr 18

Commit

3a676b9

•

1 Parent(s): 85e289b

Fixed a minor typo.

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ The OpenBezoar-HH-RLHF-SFT is an LLM that has been further instruction fine tune
 ### Model Description
-OpenBezoar-SFT is an LLM that is built upon the OpenLLaMA 3B v2 architecture. Primary purpose of performing SFT on [OpenBezoar-SFT](https://huggingface.co/SurgeGlobal/OpenBezoar-SFT) is to minimize the distribution shift before applying Direct Preference Optimization (DPO) for human preferences alignment. For more information please refer to our paper.
 ### Model Sources

 ### Model Description
+OpenBezoar-HH-RLHF-SFT is an LLM that is built upon the OpenLLaMA 3B v2 architecture. Primary purpose of performing SFT on [OpenBezoar-SFT](https://huggingface.co/SurgeGlobal/OpenBezoar-SFT) is to minimize the distribution shift before applying Direct Preference Optimization (DPO) for human preferences alignment. For more information please refer to our paper.
 ### Model Sources