Felladrin
/

gguf-Minueza-32M-UltraChat

Inference Endpoints

Model card Files Files and versions Community

gguf-Minueza-32M-UltraChat / README.md

Felladrin's picture

Update README.md

446b990 verified 7 months ago

|

736 Bytes

	---
	license: apache-2.0
	base_model: Felladrin/Minueza-32M-UltraChat
	---

	GGUF version of [Felladrin/Minueza-32M-UltraChat](https://huggingface.co/Felladrin/Minueza-32M-UltraChat).

	It was not possible to quantize the model, so only the F16 and F32 GGUF files are available.

	## Try it with [llama.cpp](https://github.com/ggerganov/llama.cpp)

	```bash
	brew install ggerganov/ggerganov/llama.cpp
	```
	```bash
	llama-cli \
	--hf-repo Felladrin/gguf-Minueza-32M-UltraChat \
	--model Minueza-32M-UltraChat.F32.gguf \
	--random-prompt \
	--temp 1.3 \
	--dynatemp-range 1.2 \
	--top-k 0 \
	--top-p 1 \
	--min-p 0.1 \
	--typical 0.85 \
	--mirostat 2 \
	--mirostat-ent 3.5 \
	--repeat-penalty 1.1 \
	--repeat-last-n -1 \
	-n 256
	```