Update README.md
Browse files
README.md
CHANGED
@@ -46,7 +46,9 @@ Hermes-3-Llama-3.1-70B ๋ฒ ์ด์ค๋ชจ๋ธ์ ์ฌ์ฉํด์ H100-80G 8๊ฐ๋ฅผ ํตํด C
|
|
46 |
-๊ณ ๊ฐ ๋ฆฌ๋ทฐ๋ ์์
ํฌ์คํ
๊ณ ์ฐจ์ ๋ถ์ ๋ฐ ์ฝ๋ฉ๊ณผ ์๋ฌธ, ์ํ, ๋
ผ๋ฆฌํ๋จ ๋ฑ์ด ๊ฐํ๋ ๋ชจ๋ธ<br>
|
47 |
-128k-Context Window<br>
|
48 |
-ํ๊ธ Function Call ๋ฐ Tool Calling ์ง์<br>
|
49 |
-
-Deepspeed Stage=3, rslora ๋ฐ BAdam Layer Mode ์ฌ์ฉ <br
|
|
|
|
|
50 |
|
51 |
Finetuned by Mr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company specializing in AI and big data analytics <br>
|
52 |
CPT(Continue-Pretraining)->SFT->DPO training model based on Hermes-3-Llama-3.1-70B through 8 H100-80Gs as a Korean language model <br>
|
|
|
46 |
-๊ณ ๊ฐ ๋ฆฌ๋ทฐ๋ ์์
ํฌ์คํ
๊ณ ์ฐจ์ ๋ถ์ ๋ฐ ์ฝ๋ฉ๊ณผ ์๋ฌธ, ์ํ, ๋
ผ๋ฆฌํ๋จ ๋ฑ์ด ๊ฐํ๋ ๋ชจ๋ธ<br>
|
47 |
-128k-Context Window<br>
|
48 |
-ํ๊ธ Function Call ๋ฐ Tool Calling ์ง์<br>
|
49 |
+
-Deepspeed Stage=3, rslora ๋ฐ BAdam Layer Mode ์ฌ์ฉ <br>
|
50 |
+
-ollama run benedict/linkbricks-hermes3-llama3.1-70b-korean-advanced-q4
|
51 |
+
<br><br>
|
52 |
|
53 |
Finetuned by Mr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company specializing in AI and big data analytics <br>
|
54 |
CPT(Continue-Pretraining)->SFT->DPO training model based on Hermes-3-Llama-3.1-70B through 8 H100-80Gs as a Korean language model <br>
|