Text Generation
Transformers
Safetensors
openelm
custom_code
qicao-apple commited on
Commit
5607895
1 Parent(s): 7f6a2a3

add OpenELM-3B

Browse files
Files changed (1) hide show
  1. README.md +26 -0
README.md CHANGED
@@ -161,3 +161,29 @@ lm_eval --model hf \
161
  ## Bias, Risks, and Limitations
162
 
163
  The release of OpenELM models aims to empower and enrich the open research community by providing access to state-of-the-art language models. Trained on publicly available datasets, these models are made available without any safety guarantees. Consequently, there exists the possibility of these models producing outputs that are inaccurate, harmful, biased, or objectionable in response to user prompts. Thus, it is imperative for users and developers to undertake thorough safety testing and implement appropriate filtering mechanisms tailored to their specific requirements.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
161
  ## Bias, Risks, and Limitations
162
 
163
  The release of OpenELM models aims to empower and enrich the open research community by providing access to state-of-the-art language models. Trained on publicly available datasets, these models are made available without any safety guarantees. Consequently, there exists the possibility of these models producing outputs that are inaccurate, harmful, biased, or objectionable in response to user prompts. Thus, it is imperative for users and developers to undertake thorough safety testing and implement appropriate filtering mechanisms tailored to their specific requirements.
164
+
165
+ ## Citation
166
+
167
+ If you find our work useful, please cite:
168
+
169
+ ```BibTex
170
+ @article{mehtaOpenELMEfficientLanguage2024,
171
+ title = {{OpenELM}: {An} {Efficient} {Language} {Model} {Family} with {Open}-source {Training} and {Inference} {Framework}},
172
+ shorttitle = {{OpenELM}},
173
+ url = {https://arxiv.org/abs/2404.14619v1},
174
+ language = {en},
175
+ urldate = {2024-04-24},
176
+ journal = {arXiv.org},
177
+ author = {Mehta, Sachin and Sekhavat, Mohammad Hossein and Cao, Qingqing and Horton, Maxwell and Jin, Yanzi and Sun, Chenfan and Mirzadeh, Iman and Najibi, Mahyar and Belenko, Dmitry and Zatloukal, Peter and Rastegari, Mohammad},
178
+ month = apr,
179
+ year = {2024},
180
+ }
181
+
182
+ @inproceedings{mehta2022cvnets,
183
+ author = {Mehta, Sachin and Abdolhosseini, Farzad and Rastegari, Mohammad},
184
+ title = {CVNets: High Performance Library for Computer Vision},
185
+ year = {2022},
186
+ booktitle = {Proceedings of the 30th ACM International Conference on Multimedia},
187
+ series = {MM '22}
188
+ }
189
+ ```