Original model: https://huggingface.co/Doctor-Shotgun/Nous-Capybara-limarpv3-34B
Using cleaned pippa.parquet as calibration dataset.
4.65bpw - Can run 21k context in ~23.3GB VRAM with 8bit-cache option.
Original model: https://huggingface.co/Doctor-Shotgun/Nous-Capybara-limarpv3-34B
Using cleaned pippa.parquet as calibration dataset.
4.65bpw - Can run 21k context in ~23.3GB VRAM with 8bit-cache option.