tmpupload
/

superhot-30b-8k-no-rlhf-test-128g-GPTQ

Text Generation

Inference Endpoints

Model card Files Files and versions Community

superhot-30b-8k-no-rlhf-test-128g-GPTQ / README.md

3v324v23's picture

Model upload.

2d09b37 over 1 year ago

|

668 Bytes

	# superhot-30b-8k-4bit-128g-safetensors

	Merged base LLaMA and LoRA with this: https://github.com/tloen/alpaca-lora
	Base LLaMA 30B: https://huggingface.co/huggyllama/llama-30b
	SuperCOT 30B 8k LoRA: https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test

	``` sh
	BASE_MODEL=huggyllama_llama-30b LORA=kaiokendev_superhot-30b-8k-no-rlhf-test python export_hf_checkpoint.py
	```

	Quantized with AutoGPTQ: https://github.com/PanQiWei/AutoGPTQ

	``` sh
	python quant_with_alpaca.py --pretrained_model_dir superhot-30b-8k-safetensors --quantized_model_dir superhot-30b-8k-4bit-128g-safetensors --bits 4 --group_size 128 --desc_act --num_samples 256 --save_and_reload
	```