zhuqi commited on
Commit
4d11bc6
1 Parent(s): 634584b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -55,18 +55,18 @@ Refer to [ConvLab-3](https://github.com/ConvLab/ConvLab-3) for model description
55
 
56
  The following hyperparameters were used during training:
57
  - learning_rate: 0.001
58
- - train_batch_size: 128
59
  - eval_batch_size: 64
60
  - seed: 42
61
  - gradient_accumulation_steps: 2
62
- - total_train_batch_size: 256
63
  - optimizer: Adafactor
64
  - lr_scheduler_type: linear
65
  - num_epochs: 10.0
66
 
67
  ### Framework versions
68
 
69
- - Transformers 4.18.0
70
- - Pytorch 1.10.2+cu102
71
- - Datasets 1.18.3
72
- - Tokenizers 0.11.0
 
55
 
56
  The following hyperparameters were used during training:
57
  - learning_rate: 0.001
58
+ - train_batch_size: 64
59
  - eval_batch_size: 64
60
  - seed: 42
61
  - gradient_accumulation_steps: 2
62
+ - total_train_batch_size: 128
63
  - optimizer: Adafactor
64
  - lr_scheduler_type: linear
65
  - num_epochs: 10.0
66
 
67
  ### Framework versions
68
 
69
+ - Transformers 4.20.1
70
+ - Pytorch 1.11.0+cu113
71
+ - Datasets 2.3.2
72
+ - Tokenizers 0.12.1