princeton-nlp
/

QuRater-1.3B

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

princeton-nlp commited on Apr 16

Commit

bd61c77

•

1 Parent(s): 10a6156

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -22,4 +22,5 @@ Instead, compute the quality ratings for windows of up to 512 token and average
 In the paper, we document various types of bias that are present in the quality ratings from the QuRater model (biases related to domains, topics, social roles, regions and languages - see Section 6 of the paper).
 Hence, be aware that data selection with QuRating could have unintended and harmful effects on the language model that is being trained.
 We strongly recommend a comprehensive evaluation of the language model for these and other types of bias, particularly before real-world deployment.
-We hope that releasing the data/models can facilitate future research aimed at uncovering and mitigating such biases.

 In the paper, we document various types of bias that are present in the quality ratings from the QuRater model (biases related to domains, topics, social roles, regions and languages - see Section 6 of the paper).
 Hence, be aware that data selection with QuRating could have unintended and harmful effects on the language model that is being trained.
 We strongly recommend a comprehensive evaluation of the language model for these and other types of bias, particularly before real-world deployment.
+We hope that releasing the data/models can facilitate future research aimed at uncovering and mitigating such biases.
+Note that the quality ratings do not measure the social or literary value of a text and should *not* be used for textual or demographic studies.