Turdus / README.md
hromi's picture
Update README.md
de8a9fb verified
|
raw
history blame
446 Bytes
metadata
base_model: mlabonne/NeuralMarcoro14-7B
license: apache-2.0
tags:
  - mlabonne/NeuralMarcoro14-7B
  - dpo
  - 7B
  - winograd
  - mmlu_abstract_algebra
  - mistral
datasets:
  - hromi/winograd_dpo_basic

udkai_Turdus

A less contaminated version of udkai/Garrulus and the second model to be discussed in the paper Subtle DPO-Contamination with modified Winogrande increases TruthfulQA, Hellaswag & ARC !