base_model: mlabonne/NeuralMarcoro14-7B | |
license: apache-2.0 | |
tags: | |
- mlabonne/NeuralMarcoro14-7B | |
- dpo | |
- 7B | |
- winograd | |
- mmlu_abstract_algebra | |
- mistral | |
datasets: | |
- hromi/winograd_dpo_basic | |
# udkai_Turdus | |
A less contaminated version of [udkai/Garrulus](https://huggingface.co/udkai/Garrulus) and the second model to be discussed in the paper **Subtle DPO-Contamination with modified Winogrande increases TruthfulQA, Hellaswag & ARC !** |