openthaigpt
/

openthaigpt-1.0.0-7b-chat

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Update READMD.md Ollama section

#5

by pacozaa - opened Apr 9

base: refs/heads/main

←

from: refs/pr/5

Discussion Files changed

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -225,6 +225,20 @@ curl --location 'http://localhost:8000/completion' \
 }'
 ```
 ### GPU Memory Requirements
 | **Number of Parameters** | **FP 16 bits** | **8 bits (Quantized)** | **4 bits (Quantized)** | **Example Graphic Card for 4 bits** |
 |------------------|----------------|------------------------|------------------------|---------------------------------------------|

 }'
 ```
+### Ollama
+There are two ways to run on ollama
+1. From this repo Modelfile and 4 bit quantized gguf
+```bash
+ollama create -f ./Modelfile
+```
+2. From Ollama CLI
+```bash
+ollama run pacozaa/openthaigpt
+```
 ### GPU Memory Requirements
 | **Number of Parameters** | **FP 16 bits** | **8 bits (Quantized)** | **4 bits (Quantized)** | **Example Graphic Card for 4 bits** |
 |------------------|----------------|------------------------|------------------------|---------------------------------------------|