Pi3141
/

gpt4-x-alpaca-native-13B-ggml

Model card Files Files and versions Community

Tutorial

by tahaw863 - opened Apr 28, 2023

Discussion

tahaw863

Apr 28, 2023

Please, provide a tutorial or a link how i can integrate or use it.

Pi3141

Owner Apr 28, 2023

These are to be used with llama.cpp
https://github.com/ggerganov/llama.cpp

All instructions will be in that repo.

tahaw863

Apr 28, 2023

Btw, i came here because Dalai coudn't download alpaca 13B. This link "https://huggingface.co/Pi3141/alpaca-13B-ggml/resolve/main/ggml-model-q4_0.bin'"

Pi3141

Owner Apr 28, 2023

Dalai is no longer supported, it's outdated and I won't keep q4_0 on the old version.

tahaw863

Apr 28, 2023

Okay thank you. I want to ask you, is it each file in your repo independent of the others?

Pi3141

Owner Apr 28, 2023

Yep, they are all independent.

tahaw863

Apr 28, 2023

Which one is the best from your opinion? because i'm trying to build chat bot for discord ; p

Pi3141

Owner Apr 28, 2023

Q4_0 and Q4_2

tahaw863

Apr 28, 2023

I couldn't find how i can use my gpu to make it faster. it gets stuck after running it on docker

Pi3141

Owner Apr 28, 2023

You can't use GPU. If you want to use GPU, then you'll need to run it with pytorch. This isn't the right repo for you in that case.

tahaw863

Apr 28, 2023

what's the difference ? between the two. i have fairly strong CPU and GPU. what do you recommend

Pi3141

Owner Apr 28, 2023

GPU is faster. But you need at least 12GB of VRAM.

concedo

May 1, 2023

Any idea why the hashes for q4_1 changed? Iirc that format has not been modified in ggjt dropped.

Pi3141

Owner May 1, 2023

Iirc llama.cpp did some changes to the quantization

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment