TheBloke
/

Vicuna-13B-1-3-SuperHOT-8K-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Jun 26, 2023

Commit

dcb44d0

•

1 Parent(s): 599184c

Initial GPTQ model commit

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -17,9 +17,9 @@ license: other
 </div>
 <!-- header end -->
-# LmSys' Vicuna 13B 1.3.0 merged with Kaio Ken's SuperHOT 8K GPTQ
-These files are GPTQ 4bit model files for [LmSys' Vicuna 13B 1.3.0 merged with Kaio Ken's SuperHOT 8K](https://huggingface.co/lmsys/vicuna-13b-v1.3) merged with [Kaio Ken's SuperHOT 8K](https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test).
 It is the result of quantising to 4bit using [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa).
@@ -206,7 +206,7 @@ I trained the LoRA with the following configuration:
 - AdamW beta1 of 0.9 and beta2 0.99, epsilon of 1e-5
 - Trained on 4-bit base model
-# Original model card: LmSys' Vicuna 13B 1.3.0 merged with Kaio Ken's SuperHOT 8K
 # Vicuna Model Card

 </div>
 <!-- header end -->
+# LmSys' Vicuna 13B 1.3.0 GPTQ
+These files are GPTQ 4bit model files for [LmSys' Vicuna 13B 1.3.0](https://huggingface.co/lmsys/vicuna-13b-v1.3) merged with [Kaio Ken's SuperHOT 8K](https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test).
 It is the result of quantising to 4bit using [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa).
 - AdamW beta1 of 0.9 and beta2 0.99, epsilon of 1e-5
 - Trained on 4-bit base model
+# Original model card: LmSys' Vicuna 13B 1.3.0
 # Vicuna Model Card