Edit model card

Uploaded model

Mistral7b + SFT + 4bit DPO training with unalignment/toxic-dpo-v0.2 == ToxicMist? ☣🌫

(Just the lora)

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Model tree for akaistormherald/ToxicMist-v0.2-7B-DPO-LORA

Base model

Finetuned

(66)

this model