princeton-nlp commited on
Commit
bd61c77
1 Parent(s): 10a6156

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -22,4 +22,5 @@ Instead, compute the quality ratings for windows of up to 512 token and average
22
  In the paper, we document various types of bias that are present in the quality ratings from the QuRater model (biases related to domains, topics, social roles, regions and languages - see Section 6 of the paper).
23
  Hence, be aware that data selection with QuRating could have unintended and harmful effects on the language model that is being trained.
24
  We strongly recommend a comprehensive evaluation of the language model for these and other types of bias, particularly before real-world deployment.
25
- We hope that releasing the data/models can facilitate future research aimed at uncovering and mitigating such biases.
 
 
22
  In the paper, we document various types of bias that are present in the quality ratings from the QuRater model (biases related to domains, topics, social roles, regions and languages - see Section 6 of the paper).
23
  Hence, be aware that data selection with QuRating could have unintended and harmful effects on the language model that is being trained.
24
  We strongly recommend a comprehensive evaluation of the language model for these and other types of bias, particularly before real-world deployment.
25
+ We hope that releasing the data/models can facilitate future research aimed at uncovering and mitigating such biases.
26
+ Note that the quality ratings do not measure the social or literary value of a text and should *not* be used for textual or demographic studies.