tmpupload
/

superhot-30b-8k-no-rlhf-test-128g-GPTQ

Text Generation

Inference Endpoints

Model card Files Files and versions Community

tmpupload commited on Jun 27, 2023

Commit

6c3a855

•

1 Parent(s): 7cc788c

Update README.md

Files changed (1) hide show

README.md +7 -2

README.md CHANGED Viewed

@@ -1,12 +1,17 @@
 # superhot-30b-8k-4bit-128g-safetensors
 Merged base LLaMA and LoRA with this:
 https://github.com/tloen/alpaca-lora
 Base LLaMA 30B:
 https://huggingface.co/huggyllama/llama-30b
-SuperCOT 30B 8k LoRA:
 https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test
 ``` sh
@@ -56,4 +61,4 @@ CUDA_VISIBLE_DEVICES=0 python test_benchmark_inference.py \
  -- Loading dataset...
  -- Testing 40 chunks....
  ** Perplexity: 4.6612
-```

+---
+license: other
+---
 # superhot-30b-8k-4bit-128g-safetensors
+**Note: Maximum sequence length (max_seq_len) and compression factor (compress_pos_emb) need to be set to 8192 and 4.**
 Merged base LLaMA and LoRA with this:
 https://github.com/tloen/alpaca-lora
 Base LLaMA 30B:
 https://huggingface.co/huggyllama/llama-30b
+SuperHOT 30B 8k no-rlhf-test LoRA:
 https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test
 ``` sh
  -- Loading dataset...
  -- Testing 40 chunks....
  ** Perplexity: 4.6612
+```