Unbabel
/

wmt20-comet-qe-da

arXiv:2010.15535

Model card Files Files and versions Community

RicardoRei commited on Feb 14, 2023

Commit

ca2a08f

•

1 Parent(s): eb7385d

WMT20 model

Files changed (3) hide show

README.md +1 -3
checkpoints/model.ckpt +2 -2
hparams.yaml +7 -7

README.md CHANGED Viewed

@@ -103,9 +103,7 @@ tags:
 This is a [COMET](https://github.com/Unbabel/COMET) quality estimation model: It receives a source sentence and the respective translation and returns a score that reflects the quality of the translation.
-**NOTE:**
-- This model was recently replaced by an improved version [wmt22-cometkiwi-da](https://huggingface.co/Unbabel/wmt22-cometkiwi-da)
-- This model is equivalent as `wmt20-comet-qe-da-v2` from previous [COMET](https://github.com/Unbabel/COMET) versions (<2.0).
 # Paper

 This is a [COMET](https://github.com/Unbabel/COMET) quality estimation model: It receives a source sentence and the respective translation and returns a score that reflects the quality of the translation.
+**NOTE:** This model was recently replaced by an improved version [wmt22-cometkiwi-da](https://huggingface.co/Unbabel/wmt22-cometkiwi-da)
 # Paper

checkpoints/model.ckpt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:05d892bf4a3e34b9a4de239109387d43107b2a8c55ad34b73a929ca6c1ede24e
-size 2277497201

 version https://git-lfs.github.com/spec/v1
+oid sha256:0dc381dfa76e78607d95f3ff8245e1b7e7010252fda43e6163802f67eba95732
+size 2277430715

hparams.yaml CHANGED Viewed

@@ -1,21 +1,21 @@
 activations: Tanh
-batch_size: 4
 class_identifier: referenceless_regression_metric
 dropout: 0.1
 encoder_learning_rate: 1.0e-05
 encoder_model: XLM-RoBERTa
-final_activation: null
 hidden_sizes:
 - 2048
 - 1024
 keep_embeddings_frozen: true
 layer: mix
 layerwise_decay: 0.95
-learning_rate: 3.1e-05
 load_weights_from_checkpoint: null
-nr_frozen_epochs: 0.3
-optimizer: AdamW
 pool: avg
 pretrained_model: xlm-roberta-large
-train_data: data/scores-1719.csv
-validation_data: data/2020-mqm.csv

+# Training Seed 3
 activations: Tanh
+batch_size: 2
 class_identifier: referenceless_regression_metric
 dropout: 0.1
 encoder_learning_rate: 1.0e-05
 encoder_model: XLM-RoBERTa
 hidden_sizes:
 - 2048
 - 1024
 keep_embeddings_frozen: true
 layer: mix
 layerwise_decay: 0.95
+learning_rate: 3.0e-05
 load_weights_from_checkpoint: null
+optimizer: Adam
 pool: avg
 pretrained_model: xlm-roberta-large
+train_data: data/scores_1719.csv
+validation_data: data/scores_1719.csv
+final_activation: "Sigmoid"