anon8231489123
/

gpt4-x-alpaca-13b-native-4bit-128g

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

anon8231489123 commited on Apr 1, 2023

Commit

f267949

•

1 Parent(s): fadc38f

Update README.md

Files changed (1) hide show

README.md +2 -13

README.md CHANGED Viewed

@@ -5,12 +5,7 @@ Okay... Two different models now. One generated in the Triton branch, one genera
 Cuda info (use this one):
 Command:
-CUDA_VISIBLE_DEVICES=0 python llama.py ./models/chavinlo-gpt4-x-alpaca
---wbits 4
---true-sequential
---groupsize 128
---save gpt-x-alpaca-13b-native-4bit-128g-cuda.pt
 Prev. info
@@ -25,10 +20,4 @@ Because of this, it appears to be incompatible with Oobabooga at the moment. Sta
 Command:
-CUDA_VISIBLE_DEVICES=0 python llama.py ./models/chavinlo-gpt4-x-alpaca
---wbits 4
---true-sequential
---act-order
---groupsize 128
---save gpt-x-alpaca-13b-native-4bit-128g.pt

 Cuda info (use this one):
 Command:
+CUDA_VISIBLE_DEVICES=0 python llama.py ./models/chavinlo-gpt4-x-alpaca --wbits 4 --true-sequential --groupsize 128 --save gpt-x-alpaca-13b-native-4bit-128g-cuda.pt
 Prev. info
 Command:
+CUDA_VISIBLE_DEVICES=0 python llama.py ./models/chavinlo-gpt4-x-alpaca --wbits 4 --true-sequential --act-order --groupsize 128 --save gpt-x-alpaca-13b-native-4bit-128g.pt