facat commited on
Commit
d5ab356
1 Parent(s): 81b208a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -13
README.md CHANGED
@@ -87,20 +87,20 @@ pipeline_tag: text-generation
87
  # Inrtoduction
88
 
89
  <img src="https://hackmd.io/_uploads/HJlDtzhBa.png" id="fig-sus"
90
- alt="Figure 1: DALL·E 2023-12-01 11.03.28 - An imposing, majestic wild boar combined with elements of a futuristic transformer robot. The boar itself should be intricately blended with these tra" />
91
 
92
  **SUS-Chat** is a 34B bilingual Chinese-English dialogue model, jointly
93
  released by the **Southern University of Science and Technology** and
94
  **Cognitive Computing and Natural Language Center of International
95
- Digital Economy Academy (IDEA-CCNL)**. The SUS-Chat-34B model has been
96
- fine-tuned on millions of high-quality, multilingual instruction data.
97
- While maintaining the strong language capabilities of the base model,
98
- the SUS-Chat-34B model has improved the model’s response to human
99
- instructions through high-quality instruction fine-tuning and excels at
100
- imitating human thought processes through chains of thought. It
101
- introduces inter-instruction attention sharing in long texts, expanding
102
- the window size from 4K to 8K, significantly enhancing the usability of
103
- multi-round dialogues.
104
 
105
  It has surpassed all models of the same size in almost all benchmark
106
  tests and is better suited to meet the practical needs of complex
@@ -147,7 +147,7 @@ similar scale and achieved the most advanced comprehensive performance.
147
 
148
  <img
149
  src="https://github.com/SUSTech-IDEA/SUS-Chat/raw/main/assets/radar.png"
150
- id="fig-bench" alt="Figure 2: Benchmark" />
151
 
152
  # Usage
153
 
@@ -231,5 +231,4 @@ model.
231
  This model is developed entirely for academic research and free
232
  commercial use, but it must adhere to the
233
  [license](https://github.com/SUSTech-IDEA/SUS-Chat/blob/main/MODEL_LICENSE_AGREEMENT.txt)
234
- from 01-ai.
235
-
 
87
  # Inrtoduction
88
 
89
  <img src="https://hackmd.io/_uploads/HJlDtzhBa.png" id="fig-sus"
90
+ alt="Figure 1: DALL·E 2023-12-01 11.03.28 - An imposing, majestic wild boar combined with elements of a futuristic transformer robot. The boar itself should be intricately blended with these tra" />
91
 
92
  **SUS-Chat** is a 34B bilingual Chinese-English dialogue model, jointly
93
  released by the **Southern University of Science and Technology** and
94
  **Cognitive Computing and Natural Language Center of International
95
+ Digital Economy Academy (IDEA-CCNL)**. This model is based on
96
+ `01-ai/Yi-34B` and has been fine-tuned on millions of high-quality,
97
+ multilingual instruction data. While maintaining the strong language
98
+ capabilities of the base model, the SUS-Chat-34B model has improved the
99
+ model’s response to human instructions through high-quality instruction
100
+ fine-tuning and excels at imitating human thought processes through
101
+ chains of thought. It introduces inter-instruction attention sharing in
102
+ long texts, expanding the window size from 4K to 8K, significantly
103
+ enhancing the usability of multi-round dialogues.
104
 
105
  It has surpassed all models of the same size in almost all benchmark
106
  tests and is better suited to meet the practical needs of complex
 
147
 
148
  <img
149
  src="https://github.com/SUSTech-IDEA/SUS-Chat/raw/main/assets/radar.png"
150
+ id="fig-bench" alt="Figure 2: Benchmark" />
151
 
152
  # Usage
153
 
 
231
  This model is developed entirely for academic research and free
232
  commercial use, but it must adhere to the
233
  [license](https://github.com/SUSTech-IDEA/SUS-Chat/blob/main/MODEL_LICENSE_AGREEMENT.txt)
234
+ from 01-ai.