Ray2333
/

GRM-Gemma-2B-rewardmodel-ft

Model card Files Files and versions Community

Ray2333 commited on 7 days ago

Commit

1d276fb

•

1 Parent(s): 2d1c1bd

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -19,6 +19,7 @@ The Skywork preference dataset demonstrates that a small high-quality dataset ca
 ## Evaluation
 We evaluate GRM-Gemma-2B-rewardmodel-ft on the [reward model benchmark](https://huggingface.co/spaces/allenai/reward-bench), where it achieved SOTA performance among models smaller than 6B.
 |       Model               | Average       |  Chat     |     Chat Hard      |     Safety      |     Reasoning     |
 |:-------------------------:|:-------------:|:---------:|:---------:|:--------:|:-----------:|

 ## Evaluation
 We evaluate GRM-Gemma-2B-rewardmodel-ft on the [reward model benchmark](https://huggingface.co/spaces/allenai/reward-bench), where it achieved SOTA performance among models smaller than 6B.
+**When evaluated using reward bench, please add '--not_quantized' to avoid performance drop.**
 |       Model               | Average       |  Chat     |     Chat Hard      |     Safety      |     Reasoning     |
 |:-------------------------:|:-------------:|:---------:|:---------:|:--------:|:-----------:|