anon8231489123
/

gpt4-x-alpaca-13b-native-4bit-128g

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

anon8231489123 commited on Apr 1, 2023

Commit

8a7481d

•

1 Parent(s): eefa152

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -1,10 +1,18 @@
-Update: Okay... Two different models now. One generated in the Triton branch, one generated in Cuda. Use the Cuda one for now unless the Triton branch becomes widely used.
 Cuda info (use this one):
 Command:
 CUDA_VISIBLE_DEVICES=0 python llama.py ./models/chavinlo-gpt4-x-alpaca
 --wbits 4
 --true-sequential
 --groupsize 128
 --save gpt-x-alpaca-13b-native-4bit-128g-cuda.pt

+Update:
+Okay... Two different models now. One generated in the Triton branch, one generated in Cuda. Use the Cuda one for now unless the Triton branch becomes widely used.
 Cuda info (use this one):
 Command:
 CUDA_VISIBLE_DEVICES=0 python llama.py ./models/chavinlo-gpt4-x-alpaca
 --wbits 4
 --true-sequential
 --groupsize 128
 --save gpt-x-alpaca-13b-native-4bit-128g-cuda.pt