nisten commited on
Commit
d17decc
1 Parent(s): 3cc0e62

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +121 -0
README.md ADDED
@@ -0,0 +1,121 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: [deepseek-ai/DeepSeek-V2-Chat-0628]
3
+ ---
4
+
5
+ #### 🚀 Custom quantizations of DeepSeek-V2-Chat-0628 supercharged for CPU inference! 🖥️
6
+
7
+ ### 🧠 This IQ4XM version uses GGML TYPE IQ_4_XS 4bit in combination with q8_0 bit for blazing fast performance with minimal loss, leveraging int8 optimizations on most newer server CPUs.
8
+ ### 🛠️ While it required some custom code wizardry, it's fully compatible with standard llama.cpp from GitHub or just search for nisten in lmstudio.
9
+
10
+ >[!TIP]
11
+ >🔥 The following 4-bit version is my personal go-to, delivering jaw-dropping performance on ARM cores.
12
+ >
13
+ >📁 No need for file concatenation - just point llama-cli at the first file and watch the magic happen!
14
+ >
15
+ >💻 Ready to delve in baby? Here's your command-line spell for interactive mode (prompt.txt is optional, but recommended for maximum sorcery):
16
+ >```bash
17
+ >./llama-cli --temp 0.4 -m deepseek_0628_cpu_optimized_iq4xm-00001-of-00004.gguf -c 32000 -co -cnv -i -f prompt.txt
18
+ >```
19
+
20
+ ```verilog
21
+ deepseek_0628_cpu_optimized_iq4xm-00001-of-00004.gguf
22
+ deepseek_0628_cpu_optimized_iq4xm-00002-of-00004.gguf
23
+ deepseek_0628_cpu_optimized_iq4xm-00003-of-00004.gguf
24
+ deepseek_0628_cpu_optimized_iq4xm-00004-of-00004.gguf
25
+ ```
26
+
27
+ >[!TIP]
28
+ >### 🚄 Want to download faster than a caffeinated thirsty llama? Here's how:
29
+ >
30
+ >🐧 On Linux: `sudo apt install -y aria2`
31
+ >🍎 On Mac: `brew install aria2`
32
+ >
33
+ ```bash
34
+ sudo apt install -y aria2
35
+ ```
36
+
37
+ ```bash
38
+ # 🚀 For the turbocharged 4-bit IQ4XM version
39
+ aria2c -x 8 -o deepseek_0628_cpu_optimized_iq4xm-00001-of-00004.gguf \
40
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek_0628_cpu_optimized_iq4xm-00001-of-00004.gguf
41
+
42
+ aria2c -x 8 -o deepseek_0628_cpu_optimized_iq4xm-00002-of-00004.gguf \
43
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek_0628_cpu_optimized_iq4xm-00002-of-00004.gguf
44
+
45
+ aria2c -x 8 -o deepseek_0628_cpu_optimized_iq4xm-00003-of-00004.gguf \
46
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek_0628_cpu_optimized_iq4xm-00003-of-00004.gguf
47
+
48
+ aria2c -x 8 -o deepseek_0628_cpu_optimized_iq4xm-00004-of-00004.gguf \
49
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek_0628_cpu_optimized_iq4xm-00004-of-00004.gguf
50
+ ```
51
+ ```bash
52
+ # 🏋️ For the nearly lossless Q8_0 version
53
+ aria2c -x 8 -o deepseek-0628-q8_0-00001-of-00006.gguf \
54
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-q8_0-00001-of-00006.gguf
55
+
56
+ aria2c -x 8 -o deepseek-0628-q8_0-00002-of-00006.gguf \
57
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-q8_0-00002-of-00006.gguf
58
+
59
+ aria2c -x 8 -o deepseek-0628-q8_0-00003-of-00006.gguf \
60
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-q8_0-00003-of-00006.gguf
61
+
62
+ aria2c -x 8 -o deepseek-0628-q8_0-00004-of-00006.gguf \
63
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-q8_0-00004-of-00006.gguf
64
+
65
+ aria2c -x 8 -o deepseek-0628-q8_0-00005-of-00006.gguf \
66
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-q8_0-00005-of-00006.gguf
67
+
68
+ aria2c -x 8 -o deepseek-0628-q8_0-00006-of-00006.gguf \
69
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-q8_0-00006-of-00006.gguf
70
+ ```
71
+ ```bash
72
+ # 🧠 For the full-brain BF16 version
73
+ aria2c -x 8 -o deepseek-0628-bf16-00001-of-00011.gguf \
74
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00001-of-00011.gguf
75
+
76
+ aria2c -x 8 -o deepseek-0628-bf16-00002-of-00011.gguf \
77
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00002-of-00011.gguf
78
+
79
+ aria2c -x 8 -o deepseek-0628-bf16-00003-of-00011.gguf \
80
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00003-of-00011.gguf
81
+
82
+ aria2c -x 8 -o deepseek-0628-bf16-00004-of-00011.gguf \
83
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00004-of-00011.gguf
84
+
85
+ aria2c -x 8 -o deepseek-0628-bf16-00005-of-00011.gguf \
86
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00005-of-00011.gguf
87
+
88
+ aria2c -x 8 -o deepseek-0628-bf16-00006-of-00011.gguf \
89
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00006-of-00011.gguf
90
+
91
+ aria2c -x 8 -o deepseek-0628-bf16-00007-of-00011.gguf \
92
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00007-of-00011.gguf
93
+
94
+ aria2c -x 8 -o deepseek-0628-bf16-00008-of-00011.gguf \
95
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00008-of-00011.gguf
96
+
97
+ aria2c -x 8 -o deepseek-0628-bf16-00009-of-00011.gguf \
98
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00009-of-00011.gguf
99
+
100
+ aria2c -x 8 -o deepseek-0628-bf16-00010-of-00011.gguf \
101
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00010-of-00011.gguf
102
+
103
+ aria2c -x 8 -o deepseek-0628-bf16-00011-of-00011.gguf \
104
+ https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00011-of-00011.gguf
105
+ ```
106
+
107
+ 📜 The use of DeepSeek-V2-Chat-0628 model is subject to the [DeepSeek Model License](https://github.com/deepseek-ai/DeepSeek-V2/blob/main/LICENSE-MODEL). DeepSeek-V2 series supports commercial use. It's a permissive license that only restricts use for military purposes, harming minors, or patent trolling.
108
+
109
+ ### 🌟 Model Information
110
+
111
+ DeepSeek-V2-Chat-0628 is the latest and greatest in the DeepSeek family. This AI powerhouse has climbed the LMSYS Chatbot Arena Leaderboard faster than a rocket on steroids:
112
+
113
+ - 🏆 Overall Arena Ranking: #11 global
114
+ - 💻 Coding Arena Ranking: #3, global
115
+ - 🧠 Hard Prompts Arena Ranking: #7 global, better than claude opus even in english only hard-prompts
116
+
117
+ Want to seek deeper into this model's ocean of awesomeness? Swim over to the [original model card](https://huggingface.co/deepseek-ai/DeepSeek-V2-Chat-0628) and prepare to have your mind blown! 🤯
118
+
119
+ Now go forth and accelerate 🚀💡
120
+
121
+ -Nisten