Quantizing the shibing624/llama-3-8b-instruct-262k-chinese to f16, q2, q3, q4, q5, q6 and q8 with Llama.cpp.
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit