metadata
base_model: mlabonne/NeuralMarcoro14-7B
license: apache-2.0
tags:
- mlabonne/NeuralMarcoro14-7B
- dpo
- 7B
- winograd
- mmlu_abstract_algebra
- mistral
datasets:
- hromi/winograd_dpo_basic
udkai_Turdus
A less contaminated version of udkai/Garrulus and the second model to be discussed in the paper Subtle DPO-Contamination with modified Winogrande increases TruthfulQA, Hellaswag & ARC !