tmpupload
/

superhot-30b-8k-no-rlhf-test-GGML

Model card Files Files and versions Community

superhot-30b-8k-no-rlhf-test-GGML / README.md

tmpupload's picture

Update README.md

4250648 over 1 year ago

|

history blame contribute delete

825 Bytes

	---
	license: other
	---
	# superhot-30b-8k-no-rlhf-test-GGML

	Note: LLAMA_ROPE_SCALE from PR [#1967](https://github.com/ggerganov/llama.cpp/pull/1967) needs to be set to 0.25

	Merged base LLaMA and LoRA with this:
	https://github.com/tloen/alpaca-lora

	Base LLaMA 30B:
	https://huggingface.co/huggyllama/llama-30b

	SuperHOT 30B 8k no-rlhf-test LoRA:
	https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test

	``` sh
	BASE_MODEL=huggyllama_llama-30b LORA=kaiokendev_superhot-30b-8k-no-rlhf-test python export_hf_checkpoint.py
	```

	Converted and quantized with llama.cpp commit `447ccbe`:

	``` sh
	python convert.py superhot-30b-8k-safetensors --outtype f32 --outfile superhot-30b-8k-no-rlhf-test.ggmlv3.f32.bin
	./bin/quantize superhot-30b-8k-no-rlhf-test.ggmlv3.f32.bin superhot-30b-8k-no-rlhf-test.ggmlv3.Q2_K.bin Q2_K
	```