ThisIs-Developer's picture
Rename models/llama-2-7b-chat.ggmlv3.q4_0.bin.md to models/README.md
933bd37 verified
|
raw
history blame
695 Bytes
### Download the Llama 2 Model:
Download the Llama 2 model file named `llama-2-7b-chat.ggmlv3.q4_0.bin` from the following link:
[Download Llama 2 Model](https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGML/tree/main)
### Llama 2 Model Information
| Name | Quant method | Bits | Size | Max RAM required |
|--------------------------------|--------------|------|---------|------------------|
| llama-2-7b-chat.ggmlv3.q4_0.bin | q4_0 | 4 | 3.79 GB | 6.29 GB |
**Note:** After downloading the model, add the model file to the `models` directory. The file should be located at `models\llama-2-7b-chat.ggmlv3.q4_0.bin`, in order to run the code.