ReWiz
Collection
The ReWiz series is based on a subset of data from 3 different data sets, which has been used for fine tuning.
•
11 items
•
Updated
•
1
Half the data was geared towards better reasoning (EvolKit-20k and reasoning-base-20k), the other half will help to de-censor the model (WizardLM data set).
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.