Felladrin's picture
Update README.md
446b990 verified
|
raw
history blame
736 Bytes
---
license: apache-2.0
base_model: Felladrin/Minueza-32M-UltraChat
---
GGUF version of [Felladrin/Minueza-32M-UltraChat](https://huggingface.co/Felladrin/Minueza-32M-UltraChat).
It was not possible to quantize the model, so only the F16 and F32 GGUF files are available.
## Try it with [llama.cpp](https://github.com/ggerganov/llama.cpp)
```bash
brew install ggerganov/ggerganov/llama.cpp
```
```bash
llama-cli \
--hf-repo Felladrin/gguf-Minueza-32M-UltraChat \
--model Minueza-32M-UltraChat.F32.gguf \
--random-prompt \
--temp 1.3 \
--dynatemp-range 1.2 \
--top-k 0 \
--top-p 1 \
--min-p 0.1 \
--typical 0.85 \
--mirostat 2 \
--mirostat-ent 3.5 \
--repeat-penalty 1.1 \
--repeat-last-n -1 \
-n 256
```