HuggingFaceH4
/

zephyr-orpo-141b-A35b-v0.1

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

lewtun HF staff commited on Apr 11

Commit

d0d4b03

•

1 Parent(s): 5351827

Update README.md

Files changed (1) hide show

README.md +28 -0

README.md CHANGED Viewed

@@ -117,3 +117,31 @@ The following hyperparameters were used during training:
 - Pytorch 2.1.2+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.1

 - Pytorch 2.1.2+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.1
+## Citation
+If you find Zephyr 141B-A35B is useful in your work, please cite the ORPO paper:
+```
+@misc{hong2024orpo,
+      title={ORPO: Monolithic Preference Optimization without Reference Model},
+      author={Jiwoo Hong and Noah Lee and James Thorne},
+      year={2024},
+      eprint={2403.07691},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```
+You may also wish to cite the creators of this model:
+```
+@misc{zephyr_141b,
+  author = {Alvaro Bartolome and Jiwoo Hong and Noah Lee and Lewis Tunstall},
+  title = {Zephyr 141B A35B},
+  year = {2024},
+  publisher = {Hugging Face},
+  journal = {Hugging Face repository},
+  howpublished = {\url{https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1}}
+}
+```