TMElyralab
/

lyraChatGLM

Model card Files Files and versions Community

benleader commited on May 12, 2023

Commit

7991465

•

1 Parent(s): f3544b6

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -5,19 +5,19 @@ language:
 tags:
 - LLM
 - tensorRT
-- chatGLM
 ---
 ## Model Card for lyraChatGLM
-lyraChatGLM is currently the **fastest chatGLM-6B** available, as far as we know, it is also the **fisrt accelerated version of chatGLM-6B**.
-The inference speed of lyraChatGLM is **10x** faster than the original version, and we're still working to improve the performance.
 Among its main features are:
 - weights: original ChatGLM-6B weights released by THUDM.
-- device: lyraChatGLM is mainly based on FasterTransformer compiled for SM=80 (A100, for example).
-- batch_size: this model compiled with dynamic batch size, max batch_size = 8
 ## Speed
@@ -87,7 +87,7 @@ print(res)
 ``` bibtex
 @Misc{lyraChatGLM2023,
   author =       {Kangjian Wu, Zhengtao Wang, Bin Wu},
-  title =        {lyaraChatGLM: Accelerating chatGLM by 10x+},
   howpublished = {\url{https://huggingface.co/TMElyralab/lyraChatGLM}},
   year =         {2023}
 }

 tags:
 - LLM
 - tensorRT
+- ChatGLM
 ---
 ## Model Card for lyraChatGLM
+lyraChatGLM is currently the **fastest ChatGLM-6B** available. To the best of our knowledge, it is the **first accelerated version of ChatGLM-6B**.
+The inference speed of lyraChatGLM has achieved **10x** acceleration upon the original version. We are still working hard to further improve the performance.
 Among its main features are:
 - weights: original ChatGLM-6B weights released by THUDM.
+- device: lyraChatGLM is mainly based on FasterTransformer compiled for SM=80 (A100, for example), but a lot faster.
+- batch_size: compiled with dynamic batch size, max batch_size = 8
 ## Speed
 ``` bibtex
 @Misc{lyraChatGLM2023,
   author =       {Kangjian Wu, Zhengtao Wang, Bin Wu},
+  title =        {lyraChatGLM: Accelerating ChatGLM by 10x+},
   howpublished = {\url{https://huggingface.co/TMElyralab/lyraChatGLM}},
   year =         {2023}
 }