TheBloke commited on
Commit
e12f498
1 Parent(s): 25d2c4f

Updating model files

Browse files
Files changed (1) hide show
  1. README.md +24 -2
README.md CHANGED
@@ -2,6 +2,17 @@
2
  license: other
3
  inference: false
4
  ---
 
 
 
 
 
 
 
 
 
 
 
5
  # Vicuna 13B 1.1 GPTQ 4bit 128g
6
 
7
  This is a 4-bit GPTQ version of the [Vicuna 13B 1.1 model](https://huggingface.co/lmsys/vicuna-13b-delta-v1.1).
@@ -21,7 +32,7 @@ I have the following Vicuna 1.1 repositories available:
21
  **13B models:**
22
  * [Unquantized 13B 1.1 model for GPU - HF format](https://huggingface.co/TheBloke/vicuna-13B-1.1-HF)
23
  * [GPTQ quantized 4bit 13B 1.1 for GPU - `safetensors` and `pt` formats](https://huggingface.co/TheBloke/vicuna-13B-1.1-GPTQ-4bit-128g)
24
-
25
  **7B models:**
26
  * [Unquantized 7B 1.1 model for GPU - HF format](https://huggingface.co/TheBloke/vicuna-7B-1.1-HF)
27
  * [GPTQ quantized 4bit 7B 1.1 for GPU - `safetensors` and `pt` formats](https://huggingface.co/TheBloke/vicuna-7B-1.1-GPTQ-4bit-128g)
@@ -114,6 +125,17 @@ Then link that into `text-generation-webui/repositories` as described above.
114
 
115
  Or just use `vicuna-13B-1.1-GPTQ-4bit-128g.compat.no-act-order.pt` as mentioned above, which should work without any upgrades to text-generation-webui.
116
 
 
 
 
 
 
 
 
 
 
 
 
117
  # Vicuna Model Card
118
 
119
  ## Model details
@@ -152,4 +174,4 @@ A preliminary evaluation of the model quality is conducted by creating a set of
152
 
153
  ## Major updates of weights v1.1
154
  - Refactor the tokenization and separator. In Vicuna v1.1, the separator has been changed from `"###"` to the EOS token `"</s>"`. This change makes it easier to determine the generation stop criteria and enables better compatibility with other libraries.
155
- - Fix the supervised fine-tuning loss computation for better model quality.
 
2
  license: other
3
  inference: false
4
  ---
5
+ <div style="width: 100%;">
6
+ <img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
7
+ </div>
8
+ <div style="display: flex; justify-content: space-between; width: 100%;">
9
+ <div style="display: flex; flex-direction: column; align-items: flex-start;">
10
+ <p><a href="https://discord.gg/UBgz4VXf">Chat & support: my new Discord server</a></p>
11
+ </div>
12
+ <div style="display: flex; flex-direction: column; align-items: flex-end;">
13
+ <p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? Patreon coming soon!</a></p>
14
+ </div>
15
+ </div>
16
  # Vicuna 13B 1.1 GPTQ 4bit 128g
17
 
18
  This is a 4-bit GPTQ version of the [Vicuna 13B 1.1 model](https://huggingface.co/lmsys/vicuna-13b-delta-v1.1).
 
32
  **13B models:**
33
  * [Unquantized 13B 1.1 model for GPU - HF format](https://huggingface.co/TheBloke/vicuna-13B-1.1-HF)
34
  * [GPTQ quantized 4bit 13B 1.1 for GPU - `safetensors` and `pt` formats](https://huggingface.co/TheBloke/vicuna-13B-1.1-GPTQ-4bit-128g)
35
+
36
  **7B models:**
37
  * [Unquantized 7B 1.1 model for GPU - HF format](https://huggingface.co/TheBloke/vicuna-7B-1.1-HF)
38
  * [GPTQ quantized 4bit 7B 1.1 for GPU - `safetensors` and `pt` formats](https://huggingface.co/TheBloke/vicuna-7B-1.1-GPTQ-4bit-128g)
 
125
 
126
  Or just use `vicuna-13B-1.1-GPTQ-4bit-128g.compat.no-act-order.pt` as mentioned above, which should work without any upgrades to text-generation-webui.
127
 
128
+ ## Want to support my work?
129
+
130
+ I've had a lot of people ask if they can contribute. I love providing models and helping people, but it is starting to rack up pretty big cloud computing bills.
131
+
132
+ So if you're able and willing to contribute, it'd be most gratefully received and will help me to keep providing models, and work on various AI projects.
133
+
134
+ Donaters will get priority support on any and all AI/LLM/model questions, and I'll gladly quantise any model you'd like to try.
135
+
136
+ * Patreon: coming soon! (just awaiting approval)
137
+ * Ko-Fi: https://ko-fi.com/TheBlokeAI
138
+ * Discord: https://discord.gg/UBgz4VXf
139
  # Vicuna Model Card
140
 
141
  ## Model details
 
174
 
175
  ## Major updates of weights v1.1
176
  - Refactor the tokenization and separator. In Vicuna v1.1, the separator has been changed from `"###"` to the EOS token `"</s>"`. This change makes it easier to determine the generation stop criteria and enables better compatibility with other libraries.
177
+ - Fix the supervised fine-tuning loss computation for better model quality.