Update README.md
Browse files
README.md
CHANGED
@@ -55,18 +55,18 @@ Refer to [ConvLab-3](https://github.com/ConvLab/ConvLab-3) for model description
|
|
55 |
|
56 |
The following hyperparameters were used during training:
|
57 |
- learning_rate: 0.001
|
58 |
-
- train_batch_size:
|
59 |
- eval_batch_size: 64
|
60 |
- seed: 42
|
61 |
- gradient_accumulation_steps: 2
|
62 |
-
- total_train_batch_size:
|
63 |
- optimizer: Adafactor
|
64 |
- lr_scheduler_type: linear
|
65 |
- num_epochs: 10.0
|
66 |
|
67 |
### Framework versions
|
68 |
|
69 |
-
- Transformers 4.
|
70 |
-
- Pytorch 1.
|
71 |
-
- Datasets
|
72 |
-
- Tokenizers 0.
|
|
|
55 |
|
56 |
The following hyperparameters were used during training:
|
57 |
- learning_rate: 0.001
|
58 |
+
- train_batch_size: 64
|
59 |
- eval_batch_size: 64
|
60 |
- seed: 42
|
61 |
- gradient_accumulation_steps: 2
|
62 |
+
- total_train_batch_size: 128
|
63 |
- optimizer: Adafactor
|
64 |
- lr_scheduler_type: linear
|
65 |
- num_epochs: 10.0
|
66 |
|
67 |
### Framework versions
|
68 |
|
69 |
+
- Transformers 4.20.1
|
70 |
+
- Pytorch 1.11.0+cu113
|
71 |
+
- Datasets 2.3.2
|
72 |
+
- Tokenizers 0.12.1
|