Monor
/

Llama-3-8B-Instruct-262k-gguf

Inference Endpoints

Model card Files Files and versions Community

Llama-3-8B-Instruct-262k-gguf / README.md

Monor's picture

Update README.md

903dce0 verified 6 months ago

|

206 Bytes

	---
	license: apache-2.0
	---

	## Introduce

	Quantizing the [gradientai/Llama-3-8B-Instruct-262k](https://huggingface.co/gradientai/Llama-3-8B-Instruct-262k) to f16, q2, q3, q4, q5, q6 and q8 with Llama.cpp.