ner-english-ontonotes / training.log
alanakbik's picture
Update model for torch 2.0
ffa7600
2023-04-05 22:30:15,038 ----------------------------------------------------------------------------------------------------
2023-04-05 22:30:15,038 Model: "SequenceTagger(
(embeddings): StackedEmbeddings(
(list_embedding_0): WordEmbeddings(
'en-crawl'
(embedding): Embedding(1000001, 300)
)
(list_embedding_1): FlairEmbeddings(
(lm): LanguageModel(
(drop): Dropout(p=0.05, inplace=False)
(encoder): Embedding(300, 100)
(rnn): LSTM(100, 2048)
)
)
(list_embedding_2): FlairEmbeddings(
(lm): LanguageModel(
(drop): Dropout(p=0.05, inplace=False)
(encoder): Embedding(300, 100)
(rnn): LSTM(100, 2048)
)
)
)
(word_dropout): WordDropout(p=0.05)
(locked_dropout): LockedDropout(p=0.5)
(embedding2nn): Linear(in_features=4396, out_features=4396, bias=True)
(rnn): LSTM(4396, 256, batch_first=True, bidirectional=True)
(linear): Linear(in_features=512, out_features=75, bias=True)
(loss_function): ViterbiLoss()
(crf): CRF()
)"
2023-04-05 22:30:15,038 ----------------------------------------------------------------------------------------------------
2023-04-05 22:30:15,039 Corpus: "Corpus: 75187 train + 9603 dev + 9479 test sentences"
2023-04-05 22:30:15,039 ----------------------------------------------------------------------------------------------------
2023-04-05 22:30:15,039 Parameters:
2023-04-05 22:30:15,039 - learning_rate: "0.100000"
2023-04-05 22:30:15,039 - mini_batch_size: "32"
2023-04-05 22:30:15,039 - patience: "3"
2023-04-05 22:30:15,039 - anneal_factor: "0.5"
2023-04-05 22:30:15,039 - max_epochs: "150"
2023-04-05 22:30:15,039 - shuffle: "True"
2023-04-05 22:30:15,039 - train_with_dev: "True"
2023-04-05 22:30:15,039 - batch_growth_annealing: "False"
2023-04-05 22:30:15,039 ----------------------------------------------------------------------------------------------------
2023-04-05 22:30:15,039 Model training base path: "resources/taggers/release-ner-ontonotes-0"
2023-04-05 22:30:15,039 ----------------------------------------------------------------------------------------------------
2023-04-05 22:30:15,039 Device: cuda:3
2023-04-05 22:30:15,039 ----------------------------------------------------------------------------------------------------
2023-04-05 22:30:15,039 Embeddings storage mode: cpu
2023-04-05 22:30:15,039 ----------------------------------------------------------------------------------------------------
2023-04-05 22:30:32,814 epoch 1 - iter 265/2650 - loss 0.26753384 - time (sec): 17.77 - samples/sec: 3410.92 - lr: 0.100000
2023-04-05 22:31:08,483 epoch 1 - iter 530/2650 - loss 0.27294258 - time (sec): 53.44 - samples/sec: 4058.67 - lr: 0.100000
2023-04-05 22:31:51,964 epoch 1 - iter 795/2650 - loss 0.24279099 - time (sec): 96.93 - samples/sec: 4054.39 - lr: 0.100000
2023-04-05 22:32:29,163 epoch 1 - iter 1060/2650 - loss 0.22574983 - time (sec): 134.12 - samples/sec: 4104.69 - lr: 0.100000
2023-04-05 22:32:51,931 epoch 1 - iter 1325/2650 - loss 0.18673350 - time (sec): 156.89 - samples/sec: 4273.29 - lr: 0.100000
2023-04-05 22:33:21,764 epoch 1 - iter 1590/2650 - loss 0.16883297 - time (sec): 186.72 - samples/sec: 4340.27 - lr: 0.100000
2023-04-05 22:34:18,793 epoch 1 - iter 1855/2650 - loss 0.16963428 - time (sec): 243.75 - samples/sec: 4205.97 - lr: 0.100000
2023-04-05 22:34:54,838 epoch 1 - iter 2120/2650 - loss 0.16383173 - time (sec): 279.80 - samples/sec: 4277.99 - lr: 0.100000
2023-04-05 22:35:24,373 epoch 1 - iter 2385/2650 - loss 0.15943840 - time (sec): 309.33 - samples/sec: 4224.09 - lr: 0.100000
2023-04-05 22:36:00,651 epoch 1 - iter 2650/2650 - loss 0.15617985 - time (sec): 345.61 - samples/sec: 4231.23 - lr: 0.100000
2023-04-05 22:36:00,651 ----------------------------------------------------------------------------------------------------
2023-04-05 22:36:00,651 EPOCH 1 done: loss 0.1562 - lr 0.100000
2023-04-05 22:36:00,651 BAD EPOCHS (no improvement): 0
2023-04-05 22:36:00,654 ----------------------------------------------------------------------------------------------------
2023-04-05 22:36:20,175 epoch 2 - iter 265/2650 - loss 0.11266887 - time (sec): 19.52 - samples/sec: 7514.22 - lr: 0.100000
2023-04-05 22:36:40,387 epoch 2 - iter 530/2650 - loss 0.10817789 - time (sec): 39.73 - samples/sec: 7381.47 - lr: 0.100000
2023-04-05 22:37:00,484 epoch 2 - iter 795/2650 - loss 0.10710700 - time (sec): 59.83 - samples/sec: 7365.19 - lr: 0.100000
2023-04-05 22:37:20,242 epoch 2 - iter 1060/2650 - loss 0.10320501 - time (sec): 79.59 - samples/sec: 7384.58 - lr: 0.100000
2023-04-05 22:37:39,929 epoch 2 - iter 1325/2650 - loss 0.10145208 - time (sec): 99.27 - samples/sec: 7397.57 - lr: 0.100000
2023-04-05 22:37:59,130 epoch 2 - iter 1590/2650 - loss 0.09967449 - time (sec): 118.48 - samples/sec: 7419.91 - lr: 0.100000
2023-04-05 22:38:18,213 epoch 2 - iter 1855/2650 - loss 0.09813847 - time (sec): 137.56 - samples/sec: 7457.22 - lr: 0.100000
2023-04-05 22:38:37,939 epoch 2 - iter 2120/2650 - loss 0.09664446 - time (sec): 157.28 - samples/sec: 7446.97 - lr: 0.100000
2023-04-05 22:38:57,208 epoch 2 - iter 2385/2650 - loss 0.09547294 - time (sec): 176.55 - samples/sec: 7453.94 - lr: 0.100000
2023-04-05 22:39:17,307 epoch 2 - iter 2650/2650 - loss 0.09408016 - time (sec): 196.65 - samples/sec: 7436.28 - lr: 0.100000
2023-04-05 22:39:17,307 ----------------------------------------------------------------------------------------------------
2023-04-05 22:39:17,307 EPOCH 2 done: loss 0.0941 - lr 0.100000
2023-04-05 22:39:17,307 BAD EPOCHS (no improvement): 0
2023-04-05 22:39:17,310 ----------------------------------------------------------------------------------------------------
2023-04-05 22:39:37,267 epoch 3 - iter 265/2650 - loss 0.07783923 - time (sec): 19.96 - samples/sec: 7299.54 - lr: 0.100000
2023-04-05 22:39:57,216 epoch 3 - iter 530/2650 - loss 0.07650488 - time (sec): 39.91 - samples/sec: 7265.92 - lr: 0.100000
2023-04-05 22:40:16,963 epoch 3 - iter 795/2650 - loss 0.07719409 - time (sec): 59.65 - samples/sec: 7344.12 - lr: 0.100000
2023-04-05 22:40:36,712 epoch 3 - iter 1060/2650 - loss 0.07616210 - time (sec): 79.40 - samples/sec: 7340.71 - lr: 0.100000
2023-04-05 22:40:55,833 epoch 3 - iter 1325/2650 - loss 0.07542488 - time (sec): 98.52 - samples/sec: 7400.24 - lr: 0.100000
2023-04-05 22:41:15,217 epoch 3 - iter 1590/2650 - loss 0.07535956 - time (sec): 117.91 - samples/sec: 7422.77 - lr: 0.100000
2023-04-05 22:41:35,620 epoch 3 - iter 1855/2650 - loss 0.07497975 - time (sec): 138.31 - samples/sec: 7393.97 - lr: 0.100000
2023-04-05 22:41:55,694 epoch 3 - iter 2120/2650 - loss 0.07488640 - time (sec): 158.38 - samples/sec: 7387.59 - lr: 0.100000
2023-04-05 22:42:14,676 epoch 3 - iter 2385/2650 - loss 0.07452064 - time (sec): 177.37 - samples/sec: 7409.65 - lr: 0.100000
2023-04-05 22:42:34,632 epoch 3 - iter 2650/2650 - loss 0.07399061 - time (sec): 197.32 - samples/sec: 7411.06 - lr: 0.100000
2023-04-05 22:42:34,633 ----------------------------------------------------------------------------------------------------
2023-04-05 22:42:34,633 EPOCH 3 done: loss 0.0740 - lr 0.100000
2023-04-05 22:42:34,633 BAD EPOCHS (no improvement): 0
2023-04-05 22:42:34,637 ----------------------------------------------------------------------------------------------------
2023-04-05 22:42:54,041 epoch 4 - iter 265/2650 - loss 0.06719111 - time (sec): 19.40 - samples/sec: 7442.48 - lr: 0.100000
2023-04-05 22:43:13,462 epoch 4 - iter 530/2650 - loss 0.06563034 - time (sec): 38.83 - samples/sec: 7441.37 - lr: 0.100000
2023-04-05 22:43:33,226 epoch 4 - iter 795/2650 - loss 0.06592207 - time (sec): 58.59 - samples/sec: 7433.62 - lr: 0.100000
2023-04-05 22:43:53,627 epoch 4 - iter 1060/2650 - loss 0.06555259 - time (sec): 78.99 - samples/sec: 7398.78 - lr: 0.100000
2023-04-05 22:44:12,939 epoch 4 - iter 1325/2650 - loss 0.06575556 - time (sec): 98.30 - samples/sec: 7430.50 - lr: 0.100000
2023-04-05 22:44:32,656 epoch 4 - iter 1590/2650 - loss 0.06516319 - time (sec): 118.02 - samples/sec: 7431.80 - lr: 0.100000
2023-04-05 22:44:52,404 epoch 4 - iter 1855/2650 - loss 0.06536955 - time (sec): 137.77 - samples/sec: 7422.69 - lr: 0.100000
2023-04-05 22:45:12,221 epoch 4 - iter 2120/2650 - loss 0.06486299 - time (sec): 157.58 - samples/sec: 7425.58 - lr: 0.100000
2023-04-05 22:45:31,977 epoch 4 - iter 2385/2650 - loss 0.06478095 - time (sec): 177.34 - samples/sec: 7418.20 - lr: 0.100000
2023-04-05 22:45:52,125 epoch 4 - iter 2650/2650 - loss 0.06459631 - time (sec): 197.49 - samples/sec: 7404.83 - lr: 0.100000
2023-04-05 22:45:52,125 ----------------------------------------------------------------------------------------------------
2023-04-05 22:45:52,125 EPOCH 4 done: loss 0.0646 - lr 0.100000
2023-04-05 22:45:52,125 BAD EPOCHS (no improvement): 0
2023-04-05 22:45:52,129 ----------------------------------------------------------------------------------------------------
2023-04-05 22:46:11,939 epoch 5 - iter 265/2650 - loss 0.05973687 - time (sec): 19.81 - samples/sec: 7357.18 - lr: 0.100000
2023-04-05 22:46:31,940 epoch 5 - iter 530/2650 - loss 0.05917111 - time (sec): 39.81 - samples/sec: 7354.57 - lr: 0.100000
2023-04-05 22:46:50,759 epoch 5 - iter 795/2650 - loss 0.05935966 - time (sec): 58.63 - samples/sec: 7461.84 - lr: 0.100000
2023-04-05 22:47:10,249 epoch 5 - iter 1060/2650 - loss 0.05946565 - time (sec): 78.12 - samples/sec: 7461.49 - lr: 0.100000
2023-04-05 22:47:29,535 epoch 5 - iter 1325/2650 - loss 0.05923236 - time (sec): 97.41 - samples/sec: 7475.72 - lr: 0.100000
2023-04-05 22:47:49,495 epoch 5 - iter 1590/2650 - loss 0.05918268 - time (sec): 117.37 - samples/sec: 7445.03 - lr: 0.100000
2023-04-05 22:48:09,579 epoch 5 - iter 1855/2650 - loss 0.05875262 - time (sec): 137.45 - samples/sec: 7424.07 - lr: 0.100000
2023-04-05 22:48:30,187 epoch 5 - iter 2120/2650 - loss 0.05927092 - time (sec): 158.06 - samples/sec: 7393.11 - lr: 0.100000
2023-04-05 22:48:50,165 epoch 5 - iter 2385/2650 - loss 0.05920544 - time (sec): 178.04 - samples/sec: 7394.84 - lr: 0.100000
2023-04-05 22:49:09,896 epoch 5 - iter 2650/2650 - loss 0.05907764 - time (sec): 197.77 - samples/sec: 7394.37 - lr: 0.100000
2023-04-05 22:49:09,897 ----------------------------------------------------------------------------------------------------
2023-04-05 22:49:09,897 EPOCH 5 done: loss 0.0591 - lr 0.100000
2023-04-05 22:49:09,897 BAD EPOCHS (no improvement): 0
2023-04-05 22:49:09,900 ----------------------------------------------------------------------------------------------------
2023-04-05 22:49:29,334 epoch 6 - iter 265/2650 - loss 0.05432288 - time (sec): 19.43 - samples/sec: 7516.43 - lr: 0.100000
2023-04-05 22:49:49,597 epoch 6 - iter 530/2650 - loss 0.05452878 - time (sec): 39.70 - samples/sec: 7356.38 - lr: 0.100000
2023-04-05 22:50:09,358 epoch 6 - iter 795/2650 - loss 0.05516482 - time (sec): 59.46 - samples/sec: 7374.86 - lr: 0.100000
2023-04-05 22:50:29,035 epoch 6 - iter 1060/2650 - loss 0.05541375 - time (sec): 79.14 - samples/sec: 7376.45 - lr: 0.100000
2023-04-05 22:50:49,221 epoch 6 - iter 1325/2650 - loss 0.05551200 - time (sec): 99.32 - samples/sec: 7349.54 - lr: 0.100000
2023-04-05 22:51:08,701 epoch 6 - iter 1590/2650 - loss 0.05539049 - time (sec): 118.80 - samples/sec: 7362.18 - lr: 0.100000
2023-04-05 22:51:28,697 epoch 6 - iter 1855/2650 - loss 0.05542507 - time (sec): 138.80 - samples/sec: 7373.95 - lr: 0.100000
2023-04-05 22:51:48,548 epoch 6 - iter 2120/2650 - loss 0.05541067 - time (sec): 158.65 - samples/sec: 7371.34 - lr: 0.100000
2023-04-05 22:52:08,340 epoch 6 - iter 2385/2650 - loss 0.05545153 - time (sec): 178.44 - samples/sec: 7376.38 - lr: 0.100000
2023-04-05 22:52:27,814 epoch 6 - iter 2650/2650 - loss 0.05526669 - time (sec): 197.91 - samples/sec: 7388.86 - lr: 0.100000
2023-04-05 22:52:27,815 ----------------------------------------------------------------------------------------------------
2023-04-05 22:52:27,815 EPOCH 6 done: loss 0.0553 - lr 0.100000
2023-04-05 22:52:27,815 BAD EPOCHS (no improvement): 0
2023-04-05 22:52:27,818 ----------------------------------------------------------------------------------------------------
2023-04-05 22:52:47,338 epoch 7 - iter 265/2650 - loss 0.04891490 - time (sec): 19.52 - samples/sec: 7429.35 - lr: 0.100000
2023-04-05 22:53:06,924 epoch 7 - iter 530/2650 - loss 0.05120080 - time (sec): 39.11 - samples/sec: 7420.36 - lr: 0.100000
2023-04-05 22:53:27,114 epoch 7 - iter 795/2650 - loss 0.05129684 - time (sec): 59.30 - samples/sec: 7359.66 - lr: 0.100000
2023-04-05 22:53:46,768 epoch 7 - iter 1060/2650 - loss 0.05161041 - time (sec): 78.95 - samples/sec: 7380.19 - lr: 0.100000
2023-04-05 22:54:07,000 epoch 7 - iter 1325/2650 - loss 0.05165356 - time (sec): 99.18 - samples/sec: 7363.82 - lr: 0.100000
2023-04-05 22:54:27,113 epoch 7 - iter 1590/2650 - loss 0.05159275 - time (sec): 119.29 - samples/sec: 7357.10 - lr: 0.100000
2023-04-05 22:54:47,334 epoch 7 - iter 1855/2650 - loss 0.05192562 - time (sec): 139.52 - samples/sec: 7341.50 - lr: 0.100000
2023-04-05 22:55:07,101 epoch 7 - iter 2120/2650 - loss 0.05203989 - time (sec): 159.28 - samples/sec: 7353.59 - lr: 0.100000
2023-04-05 22:55:26,223 epoch 7 - iter 2385/2650 - loss 0.05204132 - time (sec): 178.40 - samples/sec: 7377.60 - lr: 0.100000
2023-04-05 22:55:45,962 epoch 7 - iter 2650/2650 - loss 0.05201672 - time (sec): 198.14 - samples/sec: 7380.33 - lr: 0.100000
2023-04-05 22:55:45,962 ----------------------------------------------------------------------------------------------------
2023-04-05 22:55:45,962 EPOCH 7 done: loss 0.0520 - lr 0.100000
2023-04-05 22:55:45,962 BAD EPOCHS (no improvement): 0
2023-04-05 22:55:45,966 ----------------------------------------------------------------------------------------------------
2023-04-05 22:56:05,316 epoch 8 - iter 265/2650 - loss 0.04853580 - time (sec): 19.35 - samples/sec: 7511.67 - lr: 0.100000
2023-04-05 22:56:25,019 epoch 8 - iter 530/2650 - loss 0.04745612 - time (sec): 39.05 - samples/sec: 7443.61 - lr: 0.100000
2023-04-05 22:56:44,649 epoch 8 - iter 795/2650 - loss 0.04766666 - time (sec): 58.68 - samples/sec: 7445.38 - lr: 0.100000
2023-04-05 22:57:03,853 epoch 8 - iter 1060/2650 - loss 0.04851233 - time (sec): 77.89 - samples/sec: 7495.21 - lr: 0.100000
2023-04-05 22:57:24,061 epoch 8 - iter 1325/2650 - loss 0.04905184 - time (sec): 98.09 - samples/sec: 7448.51 - lr: 0.100000
2023-04-05 22:57:44,701 epoch 8 - iter 1590/2650 - loss 0.04925669 - time (sec): 118.73 - samples/sec: 7396.58 - lr: 0.100000
2023-04-05 22:58:04,165 epoch 8 - iter 1855/2650 - loss 0.04955004 - time (sec): 138.20 - samples/sec: 7410.96 - lr: 0.100000
2023-04-05 22:58:24,256 epoch 8 - iter 2120/2650 - loss 0.04949264 - time (sec): 158.29 - samples/sec: 7399.93 - lr: 0.100000
2023-04-05 22:58:44,417 epoch 8 - iter 2385/2650 - loss 0.04950536 - time (sec): 178.45 - samples/sec: 7383.48 - lr: 0.100000
2023-04-05 22:59:03,464 epoch 8 - iter 2650/2650 - loss 0.04943978 - time (sec): 197.50 - samples/sec: 7404.46 - lr: 0.100000
2023-04-05 22:59:03,464 ----------------------------------------------------------------------------------------------------
2023-04-05 22:59:03,464 EPOCH 8 done: loss 0.0494 - lr 0.100000
2023-04-05 22:59:03,464 BAD EPOCHS (no improvement): 0
2023-04-05 22:59:03,467 ----------------------------------------------------------------------------------------------------
2023-04-05 22:59:23,247 epoch 9 - iter 265/2650 - loss 0.04629974 - time (sec): 19.78 - samples/sec: 7400.70 - lr: 0.100000
2023-04-05 22:59:42,953 epoch 9 - iter 530/2650 - loss 0.04662181 - time (sec): 39.49 - samples/sec: 7356.32 - lr: 0.100000
2023-04-05 23:00:02,695 epoch 9 - iter 795/2650 - loss 0.04730929 - time (sec): 59.23 - samples/sec: 7378.40 - lr: 0.100000
2023-04-05 23:00:22,037 epoch 9 - iter 1060/2650 - loss 0.04736771 - time (sec): 78.57 - samples/sec: 7395.22 - lr: 0.100000
2023-04-05 23:00:42,649 epoch 9 - iter 1325/2650 - loss 0.04760472 - time (sec): 99.18 - samples/sec: 7359.11 - lr: 0.100000
2023-04-05 23:01:02,282 epoch 9 - iter 1590/2650 - loss 0.04773742 - time (sec): 118.82 - samples/sec: 7380.24 - lr: 0.100000
2023-04-05 23:01:21,265 epoch 9 - iter 1855/2650 - loss 0.04813461 - time (sec): 137.80 - samples/sec: 7415.81 - lr: 0.100000
2023-04-05 23:01:41,073 epoch 9 - iter 2120/2650 - loss 0.04796157 - time (sec): 157.61 - samples/sec: 7404.29 - lr: 0.100000
2023-04-05 23:02:01,310 epoch 9 - iter 2385/2650 - loss 0.04766473 - time (sec): 177.84 - samples/sec: 7394.19 - lr: 0.100000
2023-04-05 23:02:21,121 epoch 9 - iter 2650/2650 - loss 0.04761529 - time (sec): 197.65 - samples/sec: 7398.59 - lr: 0.100000
2023-04-05 23:02:21,122 ----------------------------------------------------------------------------------------------------
2023-04-05 23:02:21,122 EPOCH 9 done: loss 0.0476 - lr 0.100000
2023-04-05 23:02:21,122 BAD EPOCHS (no improvement): 0
2023-04-05 23:02:21,126 ----------------------------------------------------------------------------------------------------
2023-04-05 23:02:40,052 epoch 10 - iter 265/2650 - loss 0.04401852 - time (sec): 18.93 - samples/sec: 7641.22 - lr: 0.100000
2023-04-05 23:03:00,049 epoch 10 - iter 530/2650 - loss 0.04465764 - time (sec): 38.92 - samples/sec: 7511.87 - lr: 0.100000
2023-04-05 23:03:20,403 epoch 10 - iter 795/2650 - loss 0.04494720 - time (sec): 59.28 - samples/sec: 7419.15 - lr: 0.100000
2023-04-05 23:03:40,295 epoch 10 - iter 1060/2650 - loss 0.04523321 - time (sec): 79.17 - samples/sec: 7386.01 - lr: 0.100000
2023-04-05 23:04:00,351 epoch 10 - iter 1325/2650 - loss 0.04516569 - time (sec): 99.22 - samples/sec: 7370.28 - lr: 0.100000
2023-04-05 23:04:20,180 epoch 10 - iter 1590/2650 - loss 0.04527092 - time (sec): 119.05 - samples/sec: 7384.36 - lr: 0.100000
2023-04-05 23:04:39,938 epoch 10 - iter 1855/2650 - loss 0.04589615 - time (sec): 138.81 - samples/sec: 7389.48 - lr: 0.100000
2023-04-05 23:04:59,296 epoch 10 - iter 2120/2650 - loss 0.04617516 - time (sec): 158.17 - samples/sec: 7404.27 - lr: 0.100000
2023-04-05 23:05:18,613 epoch 10 - iter 2385/2650 - loss 0.04630414 - time (sec): 177.49 - samples/sec: 7417.67 - lr: 0.100000
2023-04-05 23:05:38,785 epoch 10 - iter 2650/2650 - loss 0.04598758 - time (sec): 197.66 - samples/sec: 7398.42 - lr: 0.100000
2023-04-05 23:05:38,785 ----------------------------------------------------------------------------------------------------
2023-04-05 23:05:38,785 EPOCH 10 done: loss 0.0460 - lr 0.100000
2023-04-05 23:05:38,786 BAD EPOCHS (no improvement): 0
2023-04-05 23:05:38,790 ----------------------------------------------------------------------------------------------------
2023-04-05 23:05:59,245 epoch 11 - iter 265/2650 - loss 0.04308864 - time (sec): 20.46 - samples/sec: 7184.88 - lr: 0.100000
2023-04-05 23:06:18,165 epoch 11 - iter 530/2650 - loss 0.04341556 - time (sec): 39.38 - samples/sec: 7384.04 - lr: 0.100000
2023-04-05 23:06:38,033 epoch 11 - iter 795/2650 - loss 0.04383832 - time (sec): 59.24 - samples/sec: 7351.10 - lr: 0.100000
2023-04-05 23:06:58,051 epoch 11 - iter 1060/2650 - loss 0.04354113 - time (sec): 79.26 - samples/sec: 7357.07 - lr: 0.100000
2023-04-05 23:07:17,533 epoch 11 - iter 1325/2650 - loss 0.04422198 - time (sec): 98.74 - samples/sec: 7389.02 - lr: 0.100000
2023-04-05 23:07:37,431 epoch 11 - iter 1590/2650 - loss 0.04468420 - time (sec): 118.64 - samples/sec: 7399.86 - lr: 0.100000
2023-04-05 23:07:56,955 epoch 11 - iter 1855/2650 - loss 0.04430505 - time (sec): 138.16 - samples/sec: 7403.42 - lr: 0.100000
2023-04-05 23:08:16,911 epoch 11 - iter 2120/2650 - loss 0.04443916 - time (sec): 158.12 - samples/sec: 7405.10 - lr: 0.100000
2023-04-05 23:08:36,476 epoch 11 - iter 2385/2650 - loss 0.04487273 - time (sec): 177.69 - samples/sec: 7410.68 - lr: 0.100000
2023-04-05 23:08:56,565 epoch 11 - iter 2650/2650 - loss 0.04488036 - time (sec): 197.78 - samples/sec: 7394.05 - lr: 0.100000
2023-04-05 23:08:56,566 ----------------------------------------------------------------------------------------------------
2023-04-05 23:08:56,566 EPOCH 11 done: loss 0.0449 - lr 0.100000
2023-04-05 23:08:56,566 BAD EPOCHS (no improvement): 0
2023-04-05 23:08:56,568 ----------------------------------------------------------------------------------------------------
2023-04-05 23:09:15,735 epoch 12 - iter 265/2650 - loss 0.04334881 - time (sec): 19.17 - samples/sec: 7616.70 - lr: 0.100000
2023-04-05 23:09:35,985 epoch 12 - iter 530/2650 - loss 0.04285008 - time (sec): 39.42 - samples/sec: 7405.04 - lr: 0.100000
2023-04-05 23:09:56,340 epoch 12 - iter 795/2650 - loss 0.04256310 - time (sec): 59.77 - samples/sec: 7346.53 - lr: 0.100000
2023-04-05 23:10:25,880 epoch 12 - iter 1060/2650 - loss 0.04275866 - time (sec): 89.31 - samples/sec: 6540.63 - lr: 0.100000
2023-04-05 23:10:45,682 epoch 12 - iter 1325/2650 - loss 0.04238591 - time (sec): 109.11 - samples/sec: 6697.27 - lr: 0.100000
2023-04-05 23:11:05,621 epoch 12 - iter 1590/2650 - loss 0.04256063 - time (sec): 129.05 - samples/sec: 6790.77 - lr: 0.100000
2023-04-05 23:11:25,100 epoch 12 - iter 1855/2650 - loss 0.04275237 - time (sec): 148.53 - samples/sec: 6881.81 - lr: 0.100000
2023-04-05 23:11:45,477 epoch 12 - iter 2120/2650 - loss 0.04285750 - time (sec): 168.91 - samples/sec: 6922.19 - lr: 0.100000
2023-04-05 23:12:05,261 epoch 12 - iter 2385/2650 - loss 0.04281505 - time (sec): 188.69 - samples/sec: 6971.98 - lr: 0.100000
2023-04-05 23:12:25,108 epoch 12 - iter 2650/2650 - loss 0.04306715 - time (sec): 208.54 - samples/sec: 7012.39 - lr: 0.100000
2023-04-05 23:12:25,108 ----------------------------------------------------------------------------------------------------
2023-04-05 23:12:25,109 EPOCH 12 done: loss 0.0431 - lr 0.100000
2023-04-05 23:12:25,109 BAD EPOCHS (no improvement): 0
2023-04-05 23:12:25,111 ----------------------------------------------------------------------------------------------------
2023-04-05 23:12:44,587 epoch 13 - iter 265/2650 - loss 0.04142773 - time (sec): 19.48 - samples/sec: 7504.99 - lr: 0.100000
2023-04-05 23:13:04,053 epoch 13 - iter 530/2650 - loss 0.04156061 - time (sec): 38.94 - samples/sec: 7455.74 - lr: 0.100000
2023-04-05 23:13:24,198 epoch 13 - iter 795/2650 - loss 0.04179977 - time (sec): 59.09 - samples/sec: 7417.48 - lr: 0.100000
2023-04-05 23:13:44,205 epoch 13 - iter 1060/2650 - loss 0.04148937 - time (sec): 79.09 - samples/sec: 7364.55 - lr: 0.100000
2023-04-05 23:14:04,428 epoch 13 - iter 1325/2650 - loss 0.04152584 - time (sec): 99.32 - samples/sec: 7355.39 - lr: 0.100000
2023-04-05 23:14:24,564 epoch 13 - iter 1590/2650 - loss 0.04201696 - time (sec): 119.45 - samples/sec: 7357.53 - lr: 0.100000
2023-04-05 23:14:44,529 epoch 13 - iter 1855/2650 - loss 0.04233746 - time (sec): 139.42 - samples/sec: 7348.26 - lr: 0.100000
2023-04-05 23:15:04,002 epoch 13 - iter 2120/2650 - loss 0.04244611 - time (sec): 158.89 - samples/sec: 7359.73 - lr: 0.100000
2023-04-05 23:15:23,465 epoch 13 - iter 2385/2650 - loss 0.04262677 - time (sec): 178.35 - samples/sec: 7376.37 - lr: 0.100000
2023-04-05 23:15:43,269 epoch 13 - iter 2650/2650 - loss 0.04272274 - time (sec): 198.16 - samples/sec: 7379.81 - lr: 0.100000
2023-04-05 23:15:43,269 ----------------------------------------------------------------------------------------------------
2023-04-05 23:15:43,269 EPOCH 13 done: loss 0.0427 - lr 0.100000
2023-04-05 23:15:43,269 BAD EPOCHS (no improvement): 0
2023-04-05 23:15:43,273 ----------------------------------------------------------------------------------------------------
2023-04-05 23:16:02,785 epoch 14 - iter 265/2650 - loss 0.04040378 - time (sec): 19.51 - samples/sec: 7496.29 - lr: 0.100000
2023-04-05 23:16:22,612 epoch 14 - iter 530/2650 - loss 0.04051854 - time (sec): 39.34 - samples/sec: 7432.54 - lr: 0.100000
2023-04-05 23:16:42,152 epoch 14 - iter 795/2650 - loss 0.04045287 - time (sec): 58.88 - samples/sec: 7463.45 - lr: 0.100000
2023-04-05 23:17:01,779 epoch 14 - iter 1060/2650 - loss 0.04071775 - time (sec): 78.51 - samples/sec: 7444.60 - lr: 0.100000
2023-04-05 23:17:21,518 epoch 14 - iter 1325/2650 - loss 0.04123238 - time (sec): 98.24 - samples/sec: 7426.64 - lr: 0.100000
2023-04-05 23:17:41,438 epoch 14 - iter 1590/2650 - loss 0.04129789 - time (sec): 118.16 - samples/sec: 7410.28 - lr: 0.100000
2023-04-05 23:18:01,440 epoch 14 - iter 1855/2650 - loss 0.04154665 - time (sec): 138.17 - samples/sec: 7389.66 - lr: 0.100000
2023-04-05 23:18:21,712 epoch 14 - iter 2120/2650 - loss 0.04146642 - time (sec): 158.44 - samples/sec: 7370.57 - lr: 0.100000
2023-04-05 23:18:41,916 epoch 14 - iter 2385/2650 - loss 0.04167842 - time (sec): 178.64 - samples/sec: 7371.37 - lr: 0.100000
2023-04-05 23:19:01,361 epoch 14 - iter 2650/2650 - loss 0.04158336 - time (sec): 198.09 - samples/sec: 7382.40 - lr: 0.100000
2023-04-05 23:19:01,361 ----------------------------------------------------------------------------------------------------
2023-04-05 23:19:01,362 EPOCH 14 done: loss 0.0416 - lr 0.100000
2023-04-05 23:19:01,362 BAD EPOCHS (no improvement): 0
2023-04-05 23:19:01,364 ----------------------------------------------------------------------------------------------------
2023-04-05 23:19:21,964 epoch 15 - iter 265/2650 - loss 0.04057739 - time (sec): 20.60 - samples/sec: 7215.38 - lr: 0.100000
2023-04-05 23:19:41,561 epoch 15 - iter 530/2650 - loss 0.04016299 - time (sec): 40.20 - samples/sec: 7292.50 - lr: 0.100000
2023-04-05 23:20:01,728 epoch 15 - iter 795/2650 - loss 0.04025444 - time (sec): 60.36 - samples/sec: 7290.78 - lr: 0.100000
2023-04-05 23:20:21,120 epoch 15 - iter 1060/2650 - loss 0.04048143 - time (sec): 79.76 - samples/sec: 7343.57 - lr: 0.100000
2023-04-05 23:20:40,469 epoch 15 - iter 1325/2650 - loss 0.04087959 - time (sec): 99.10 - samples/sec: 7393.69 - lr: 0.100000
2023-04-05 23:21:00,293 epoch 15 - iter 1590/2650 - loss 0.04062910 - time (sec): 118.93 - samples/sec: 7378.63 - lr: 0.100000
2023-04-05 23:21:19,930 epoch 15 - iter 1855/2650 - loss 0.04104456 - time (sec): 138.57 - samples/sec: 7383.22 - lr: 0.100000
2023-04-05 23:21:39,700 epoch 15 - iter 2120/2650 - loss 0.04109267 - time (sec): 158.34 - samples/sec: 7400.18 - lr: 0.100000
2023-04-05 23:21:59,117 epoch 15 - iter 2385/2650 - loss 0.04104225 - time (sec): 177.75 - samples/sec: 7413.57 - lr: 0.100000
2023-04-05 23:22:18,565 epoch 15 - iter 2650/2650 - loss 0.04103241 - time (sec): 197.20 - samples/sec: 7415.62 - lr: 0.100000
2023-04-05 23:22:18,565 ----------------------------------------------------------------------------------------------------
2023-04-05 23:22:18,565 EPOCH 15 done: loss 0.0410 - lr 0.100000
2023-04-05 23:22:18,565 BAD EPOCHS (no improvement): 0
2023-04-05 23:22:18,568 ----------------------------------------------------------------------------------------------------
2023-04-05 23:22:38,201 epoch 16 - iter 265/2650 - loss 0.03886402 - time (sec): 19.63 - samples/sec: 7398.21 - lr: 0.100000
2023-04-05 23:22:58,514 epoch 16 - iter 530/2650 - loss 0.03856798 - time (sec): 39.95 - samples/sec: 7335.67 - lr: 0.100000
2023-04-05 23:23:17,939 epoch 16 - iter 795/2650 - loss 0.03875860 - time (sec): 59.37 - samples/sec: 7415.29 - lr: 0.100000
2023-04-05 23:23:37,572 epoch 16 - iter 1060/2650 - loss 0.03930614 - time (sec): 79.00 - samples/sec: 7428.67 - lr: 0.100000
2023-04-05 23:23:57,815 epoch 16 - iter 1325/2650 - loss 0.03920468 - time (sec): 99.25 - samples/sec: 7404.11 - lr: 0.100000
2023-04-05 23:24:17,587 epoch 16 - iter 1590/2650 - loss 0.03972817 - time (sec): 119.02 - samples/sec: 7396.41 - lr: 0.100000
2023-04-05 23:24:37,231 epoch 16 - iter 1855/2650 - loss 0.03969746 - time (sec): 138.66 - samples/sec: 7404.57 - lr: 0.100000
2023-04-05 23:24:56,968 epoch 16 - iter 2120/2650 - loss 0.03985046 - time (sec): 158.40 - samples/sec: 7401.81 - lr: 0.100000
2023-04-05 23:25:16,312 epoch 16 - iter 2385/2650 - loss 0.03977147 - time (sec): 177.74 - samples/sec: 7408.41 - lr: 0.100000
2023-04-05 23:25:36,004 epoch 16 - iter 2650/2650 - loss 0.03978966 - time (sec): 197.44 - samples/sec: 7406.76 - lr: 0.100000
2023-04-05 23:25:36,005 ----------------------------------------------------------------------------------------------------
2023-04-05 23:25:36,005 EPOCH 16 done: loss 0.0398 - lr 0.100000
2023-04-05 23:25:36,005 BAD EPOCHS (no improvement): 0
2023-04-05 23:25:36,008 ----------------------------------------------------------------------------------------------------
2023-04-05 23:25:55,680 epoch 17 - iter 265/2650 - loss 0.03893019 - time (sec): 19.67 - samples/sec: 7540.89 - lr: 0.100000
2023-04-05 23:26:15,159 epoch 17 - iter 530/2650 - loss 0.03896850 - time (sec): 39.15 - samples/sec: 7538.03 - lr: 0.100000
2023-04-05 23:26:34,992 epoch 17 - iter 795/2650 - loss 0.03868454 - time (sec): 58.98 - samples/sec: 7438.92 - lr: 0.100000
2023-04-05 23:26:54,357 epoch 17 - iter 1060/2650 - loss 0.03900134 - time (sec): 78.35 - samples/sec: 7468.06 - lr: 0.100000
2023-04-05 23:27:14,422 epoch 17 - iter 1325/2650 - loss 0.03886878 - time (sec): 98.41 - samples/sec: 7438.71 - lr: 0.100000
2023-04-05 23:27:34,567 epoch 17 - iter 1590/2650 - loss 0.03908285 - time (sec): 118.56 - samples/sec: 7422.44 - lr: 0.100000
2023-04-05 23:27:54,046 epoch 17 - iter 1855/2650 - loss 0.03899825 - time (sec): 138.04 - samples/sec: 7426.49 - lr: 0.100000
2023-04-05 23:28:13,796 epoch 17 - iter 2120/2650 - loss 0.03916759 - time (sec): 157.79 - samples/sec: 7429.28 - lr: 0.100000
2023-04-05 23:28:33,115 epoch 17 - iter 2385/2650 - loss 0.03939620 - time (sec): 177.11 - samples/sec: 7432.65 - lr: 0.100000
2023-04-05 23:28:52,933 epoch 17 - iter 2650/2650 - loss 0.03953680 - time (sec): 196.93 - samples/sec: 7425.98 - lr: 0.100000
2023-04-05 23:28:52,933 ----------------------------------------------------------------------------------------------------
2023-04-05 23:28:52,933 EPOCH 17 done: loss 0.0395 - lr 0.100000
2023-04-05 23:28:52,934 BAD EPOCHS (no improvement): 0
2023-04-05 23:28:52,938 ----------------------------------------------------------------------------------------------------
2023-04-05 23:29:12,185 epoch 18 - iter 265/2650 - loss 0.03687451 - time (sec): 19.25 - samples/sec: 7619.91 - lr: 0.100000
2023-04-05 23:29:31,517 epoch 18 - iter 530/2650 - loss 0.03767030 - time (sec): 38.58 - samples/sec: 7536.49 - lr: 0.100000
2023-04-05 23:29:50,801 epoch 18 - iter 795/2650 - loss 0.03800426 - time (sec): 57.86 - samples/sec: 7543.23 - lr: 0.100000
2023-04-05 23:30:10,920 epoch 18 - iter 1060/2650 - loss 0.03805599 - time (sec): 77.98 - samples/sec: 7499.18 - lr: 0.100000
2023-04-05 23:30:30,337 epoch 18 - iter 1325/2650 - loss 0.03840052 - time (sec): 97.40 - samples/sec: 7513.44 - lr: 0.100000
2023-04-05 23:30:51,052 epoch 18 - iter 1590/2650 - loss 0.03833296 - time (sec): 118.11 - samples/sec: 7421.03 - lr: 0.100000
2023-04-05 23:31:10,108 epoch 18 - iter 1855/2650 - loss 0.03819256 - time (sec): 137.17 - samples/sec: 7441.23 - lr: 0.100000
2023-04-05 23:31:29,855 epoch 18 - iter 2120/2650 - loss 0.03839978 - time (sec): 156.92 - samples/sec: 7444.39 - lr: 0.100000
2023-04-05 23:31:50,005 epoch 18 - iter 2385/2650 - loss 0.03851343 - time (sec): 177.07 - samples/sec: 7429.91 - lr: 0.100000
2023-04-05 23:32:09,711 epoch 18 - iter 2650/2650 - loss 0.03872445 - time (sec): 196.77 - samples/sec: 7431.71 - lr: 0.100000
2023-04-05 23:32:09,712 ----------------------------------------------------------------------------------------------------
2023-04-05 23:32:09,712 EPOCH 18 done: loss 0.0387 - lr 0.100000
2023-04-05 23:32:09,712 BAD EPOCHS (no improvement): 0
2023-04-05 23:32:09,715 ----------------------------------------------------------------------------------------------------
2023-04-05 23:32:30,213 epoch 19 - iter 265/2650 - loss 0.03884484 - time (sec): 20.50 - samples/sec: 7228.96 - lr: 0.100000
2023-04-05 23:32:50,229 epoch 19 - iter 530/2650 - loss 0.03841242 - time (sec): 40.51 - samples/sec: 7293.71 - lr: 0.100000
2023-04-05 23:33:10,451 epoch 19 - iter 795/2650 - loss 0.03815781 - time (sec): 60.74 - samples/sec: 7303.40 - lr: 0.100000
2023-04-05 23:33:30,064 epoch 19 - iter 1060/2650 - loss 0.03750413 - time (sec): 80.35 - samples/sec: 7348.77 - lr: 0.100000
2023-04-05 23:33:49,691 epoch 19 - iter 1325/2650 - loss 0.03731076 - time (sec): 99.98 - samples/sec: 7362.88 - lr: 0.100000
2023-04-05 23:34:09,257 epoch 19 - iter 1590/2650 - loss 0.03771377 - time (sec): 119.54 - samples/sec: 7370.16 - lr: 0.100000
2023-04-05 23:34:28,844 epoch 19 - iter 1855/2650 - loss 0.03810818 - time (sec): 139.13 - samples/sec: 7376.18 - lr: 0.100000
2023-04-05 23:34:48,041 epoch 19 - iter 2120/2650 - loss 0.03821494 - time (sec): 158.33 - samples/sec: 7397.75 - lr: 0.100000
2023-04-05 23:35:07,460 epoch 19 - iter 2385/2650 - loss 0.03813717 - time (sec): 177.74 - samples/sec: 7400.15 - lr: 0.100000
2023-04-05 23:35:26,939 epoch 19 - iter 2650/2650 - loss 0.03823628 - time (sec): 197.22 - samples/sec: 7414.72 - lr: 0.100000
2023-04-05 23:35:26,940 ----------------------------------------------------------------------------------------------------
2023-04-05 23:35:26,940 EPOCH 19 done: loss 0.0382 - lr 0.100000
2023-04-05 23:35:26,940 BAD EPOCHS (no improvement): 0
2023-04-05 23:35:26,943 ----------------------------------------------------------------------------------------------------
2023-04-05 23:35:46,625 epoch 20 - iter 265/2650 - loss 0.03559866 - time (sec): 19.68 - samples/sec: 7427.06 - lr: 0.100000
2023-04-05 23:36:06,472 epoch 20 - iter 530/2650 - loss 0.03592580 - time (sec): 39.53 - samples/sec: 7373.49 - lr: 0.100000
2023-04-05 23:36:25,896 epoch 20 - iter 795/2650 - loss 0.03698562 - time (sec): 58.95 - samples/sec: 7401.61 - lr: 0.100000
2023-04-05 23:36:45,925 epoch 20 - iter 1060/2650 - loss 0.03742835 - time (sec): 78.98 - samples/sec: 7369.34 - lr: 0.100000
2023-04-05 23:37:05,971 epoch 20 - iter 1325/2650 - loss 0.03731818 - time (sec): 99.03 - samples/sec: 7367.01 - lr: 0.100000
2023-04-05 23:37:25,748 epoch 20 - iter 1590/2650 - loss 0.03746228 - time (sec): 118.80 - samples/sec: 7391.55 - lr: 0.100000
2023-04-05 23:37:45,220 epoch 20 - iter 1855/2650 - loss 0.03786424 - time (sec): 138.28 - samples/sec: 7404.92 - lr: 0.100000
2023-04-05 23:38:05,059 epoch 20 - iter 2120/2650 - loss 0.03780190 - time (sec): 158.12 - samples/sec: 7403.73 - lr: 0.100000
2023-04-05 23:38:24,656 epoch 20 - iter 2385/2650 - loss 0.03783936 - time (sec): 177.71 - samples/sec: 7407.09 - lr: 0.100000
2023-04-05 23:38:44,500 epoch 20 - iter 2650/2650 - loss 0.03785266 - time (sec): 197.56 - samples/sec: 7402.23 - lr: 0.100000
2023-04-05 23:38:44,500 ----------------------------------------------------------------------------------------------------
2023-04-05 23:38:44,500 EPOCH 20 done: loss 0.0379 - lr 0.100000
2023-04-05 23:38:44,501 BAD EPOCHS (no improvement): 0
2023-04-05 23:38:44,504 ----------------------------------------------------------------------------------------------------
2023-04-05 23:39:04,676 epoch 21 - iter 265/2650 - loss 0.03780550 - time (sec): 20.17 - samples/sec: 7322.75 - lr: 0.100000
2023-04-05 23:39:24,449 epoch 21 - iter 530/2650 - loss 0.03773684 - time (sec): 39.95 - samples/sec: 7384.12 - lr: 0.100000
2023-04-05 23:39:43,861 epoch 21 - iter 795/2650 - loss 0.03737256 - time (sec): 59.36 - samples/sec: 7420.77 - lr: 0.100000
2023-04-05 23:40:03,746 epoch 21 - iter 1060/2650 - loss 0.03715455 - time (sec): 79.24 - samples/sec: 7376.67 - lr: 0.100000
2023-04-05 23:40:23,344 epoch 21 - iter 1325/2650 - loss 0.03741216 - time (sec): 98.84 - samples/sec: 7389.94 - lr: 0.100000
2023-04-05 23:40:43,239 epoch 21 - iter 1590/2650 - loss 0.03731411 - time (sec): 118.74 - samples/sec: 7394.94 - lr: 0.100000
2023-04-05 23:41:03,529 epoch 21 - iter 1855/2650 - loss 0.03728695 - time (sec): 139.03 - samples/sec: 7374.52 - lr: 0.100000
2023-04-05 23:41:22,750 epoch 21 - iter 2120/2650 - loss 0.03712129 - time (sec): 158.25 - samples/sec: 7393.62 - lr: 0.100000
2023-04-05 23:41:42,544 epoch 21 - iter 2385/2650 - loss 0.03749829 - time (sec): 178.04 - samples/sec: 7395.76 - lr: 0.100000
2023-04-05 23:42:02,045 epoch 21 - iter 2650/2650 - loss 0.03747642 - time (sec): 197.54 - samples/sec: 7402.82 - lr: 0.100000
2023-04-05 23:42:02,045 ----------------------------------------------------------------------------------------------------
2023-04-05 23:42:02,046 EPOCH 21 done: loss 0.0375 - lr 0.100000
2023-04-05 23:42:02,046 BAD EPOCHS (no improvement): 0
2023-04-05 23:42:02,049 ----------------------------------------------------------------------------------------------------
2023-04-05 23:42:22,281 epoch 22 - iter 265/2650 - loss 0.03542972 - time (sec): 20.23 - samples/sec: 7319.59 - lr: 0.100000
2023-04-05 23:42:41,503 epoch 22 - iter 530/2650 - loss 0.03599258 - time (sec): 39.45 - samples/sec: 7424.71 - lr: 0.100000
2023-04-05 23:43:01,282 epoch 22 - iter 795/2650 - loss 0.03656613 - time (sec): 59.23 - samples/sec: 7434.62 - lr: 0.100000
2023-04-05 23:43:20,772 epoch 22 - iter 1060/2650 - loss 0.03682889 - time (sec): 78.72 - samples/sec: 7457.28 - lr: 0.100000
2023-04-05 23:43:40,627 epoch 22 - iter 1325/2650 - loss 0.03685723 - time (sec): 98.58 - samples/sec: 7419.84 - lr: 0.100000
2023-04-05 23:43:59,838 epoch 22 - iter 1590/2650 - loss 0.03704254 - time (sec): 117.79 - samples/sec: 7440.78 - lr: 0.100000
2023-04-05 23:44:19,305 epoch 22 - iter 1855/2650 - loss 0.03718383 - time (sec): 137.26 - samples/sec: 7442.92 - lr: 0.100000
2023-04-05 23:44:40,018 epoch 22 - iter 2120/2650 - loss 0.03723118 - time (sec): 157.97 - samples/sec: 7418.12 - lr: 0.100000
2023-04-05 23:44:59,794 epoch 22 - iter 2385/2650 - loss 0.03714435 - time (sec): 177.75 - samples/sec: 7410.37 - lr: 0.100000
2023-04-05 23:45:19,182 epoch 22 - iter 2650/2650 - loss 0.03707664 - time (sec): 197.13 - samples/sec: 7418.17 - lr: 0.100000
2023-04-05 23:45:19,182 ----------------------------------------------------------------------------------------------------
2023-04-05 23:45:19,182 EPOCH 22 done: loss 0.0371 - lr 0.100000
2023-04-05 23:45:19,182 BAD EPOCHS (no improvement): 0
2023-04-05 23:45:19,185 ----------------------------------------------------------------------------------------------------
2023-04-05 23:45:38,701 epoch 23 - iter 265/2650 - loss 0.03629860 - time (sec): 19.52 - samples/sec: 7507.41 - lr: 0.100000
2023-04-05 23:45:58,284 epoch 23 - iter 530/2650 - loss 0.03570194 - time (sec): 39.10 - samples/sec: 7512.79 - lr: 0.100000
2023-04-05 23:46:18,368 epoch 23 - iter 795/2650 - loss 0.03589799 - time (sec): 59.18 - samples/sec: 7438.25 - lr: 0.100000
2023-04-05 23:46:37,983 epoch 23 - iter 1060/2650 - loss 0.03606682 - time (sec): 78.80 - samples/sec: 7447.06 - lr: 0.100000
2023-04-05 23:46:57,667 epoch 23 - iter 1325/2650 - loss 0.03658331 - time (sec): 98.48 - samples/sec: 7453.84 - lr: 0.100000
2023-04-05 23:47:17,796 epoch 23 - iter 1590/2650 - loss 0.03661517 - time (sec): 118.61 - samples/sec: 7429.45 - lr: 0.100000
2023-04-05 23:47:37,456 epoch 23 - iter 1855/2650 - loss 0.03669495 - time (sec): 138.27 - samples/sec: 7419.77 - lr: 0.100000
2023-04-05 23:47:57,066 epoch 23 - iter 2120/2650 - loss 0.03664858 - time (sec): 157.88 - samples/sec: 7419.55 - lr: 0.100000
2023-04-05 23:48:17,484 epoch 23 - iter 2385/2650 - loss 0.03688217 - time (sec): 178.30 - samples/sec: 7390.99 - lr: 0.100000
2023-04-05 23:48:36,859 epoch 23 - iter 2650/2650 - loss 0.03685770 - time (sec): 197.67 - samples/sec: 7397.86 - lr: 0.100000
2023-04-05 23:48:36,859 ----------------------------------------------------------------------------------------------------
2023-04-05 23:48:36,859 EPOCH 23 done: loss 0.0369 - lr 0.100000
2023-04-05 23:48:36,859 BAD EPOCHS (no improvement): 0
2023-04-05 23:48:36,862 ----------------------------------------------------------------------------------------------------
2023-04-05 23:48:56,710 epoch 24 - iter 265/2650 - loss 0.03582443 - time (sec): 19.85 - samples/sec: 7340.07 - lr: 0.100000
2023-04-05 23:49:16,397 epoch 24 - iter 530/2650 - loss 0.03522920 - time (sec): 39.54 - samples/sec: 7350.19 - lr: 0.100000
2023-04-05 23:49:36,260 epoch 24 - iter 795/2650 - loss 0.03538745 - time (sec): 59.40 - samples/sec: 7383.65 - lr: 0.100000
2023-04-05 23:49:55,638 epoch 24 - iter 1060/2650 - loss 0.03577330 - time (sec): 78.78 - samples/sec: 7421.88 - lr: 0.100000
2023-04-05 23:50:15,480 epoch 24 - iter 1325/2650 - loss 0.03569175 - time (sec): 98.62 - samples/sec: 7421.43 - lr: 0.100000
2023-04-05 23:50:35,563 epoch 24 - iter 1590/2650 - loss 0.03568417 - time (sec): 118.70 - samples/sec: 7399.17 - lr: 0.100000
2023-04-05 23:50:55,454 epoch 24 - iter 1855/2650 - loss 0.03568288 - time (sec): 138.59 - samples/sec: 7399.89 - lr: 0.100000
2023-04-05 23:51:15,381 epoch 24 - iter 2120/2650 - loss 0.03589099 - time (sec): 158.52 - samples/sec: 7390.31 - lr: 0.100000
2023-04-05 23:51:35,169 epoch 24 - iter 2385/2650 - loss 0.03586016 - time (sec): 178.31 - samples/sec: 7392.10 - lr: 0.100000
2023-04-05 23:51:54,542 epoch 24 - iter 2650/2650 - loss 0.03589277 - time (sec): 197.68 - samples/sec: 7397.63 - lr: 0.100000
2023-04-05 23:51:54,542 ----------------------------------------------------------------------------------------------------
2023-04-05 23:51:54,542 EPOCH 24 done: loss 0.0359 - lr 0.100000
2023-04-05 23:51:54,542 BAD EPOCHS (no improvement): 0
2023-04-05 23:51:54,545 ----------------------------------------------------------------------------------------------------
2023-04-05 23:52:13,903 epoch 25 - iter 265/2650 - loss 0.03532779 - time (sec): 19.36 - samples/sec: 7481.79 - lr: 0.100000
2023-04-05 23:52:33,772 epoch 25 - iter 530/2650 - loss 0.03500281 - time (sec): 39.23 - samples/sec: 7403.86 - lr: 0.100000
2023-04-05 23:52:54,198 epoch 25 - iter 795/2650 - loss 0.03495293 - time (sec): 59.65 - samples/sec: 7327.61 - lr: 0.100000
2023-04-05 23:53:13,691 epoch 25 - iter 1060/2650 - loss 0.03498747 - time (sec): 79.15 - samples/sec: 7358.93 - lr: 0.100000
2023-04-05 23:53:33,806 epoch 25 - iter 1325/2650 - loss 0.03532740 - time (sec): 99.26 - samples/sec: 7345.66 - lr: 0.100000
2023-04-05 23:53:53,645 epoch 25 - iter 1590/2650 - loss 0.03542241 - time (sec): 119.10 - samples/sec: 7362.68 - lr: 0.100000
2023-04-05 23:54:13,488 epoch 25 - iter 1855/2650 - loss 0.03538326 - time (sec): 138.94 - samples/sec: 7371.29 - lr: 0.100000
2023-04-05 23:54:33,354 epoch 25 - iter 2120/2650 - loss 0.03549279 - time (sec): 158.81 - samples/sec: 7367.08 - lr: 0.100000
2023-04-05 23:54:53,339 epoch 25 - iter 2385/2650 - loss 0.03574900 - time (sec): 178.79 - samples/sec: 7362.93 - lr: 0.100000
2023-04-05 23:55:12,630 epoch 25 - iter 2650/2650 - loss 0.03580619 - time (sec): 198.08 - samples/sec: 7382.53 - lr: 0.100000
2023-04-05 23:55:12,630 ----------------------------------------------------------------------------------------------------
2023-04-05 23:55:12,630 EPOCH 25 done: loss 0.0358 - lr 0.100000
2023-04-05 23:55:12,630 BAD EPOCHS (no improvement): 0
2023-04-05 23:55:12,637 ----------------------------------------------------------------------------------------------------
2023-04-05 23:55:31,770 epoch 26 - iter 265/2650 - loss 0.03350377 - time (sec): 19.13 - samples/sec: 7516.35 - lr: 0.100000
2023-04-05 23:55:52,193 epoch 26 - iter 530/2650 - loss 0.03394008 - time (sec): 39.56 - samples/sec: 7342.59 - lr: 0.100000
2023-04-05 23:56:11,865 epoch 26 - iter 795/2650 - loss 0.03414596 - time (sec): 59.23 - samples/sec: 7388.13 - lr: 0.100000
2023-04-05 23:56:31,649 epoch 26 - iter 1060/2650 - loss 0.03446851 - time (sec): 79.01 - samples/sec: 7393.17 - lr: 0.100000
2023-04-05 23:56:51,450 epoch 26 - iter 1325/2650 - loss 0.03429756 - time (sec): 98.81 - samples/sec: 7400.81 - lr: 0.100000
2023-04-05 23:57:11,456 epoch 26 - iter 1590/2650 - loss 0.03468016 - time (sec): 118.82 - samples/sec: 7395.33 - lr: 0.100000
2023-04-05 23:57:30,771 epoch 26 - iter 1855/2650 - loss 0.03473978 - time (sec): 138.13 - samples/sec: 7410.69 - lr: 0.100000
2023-04-05 23:57:50,563 epoch 26 - iter 2120/2650 - loss 0.03505711 - time (sec): 157.93 - samples/sec: 7404.82 - lr: 0.100000
2023-04-05 23:58:10,762 epoch 26 - iter 2385/2650 - loss 0.03506223 - time (sec): 178.12 - samples/sec: 7392.84 - lr: 0.100000
2023-04-05 23:58:30,277 epoch 26 - iter 2650/2650 - loss 0.03529119 - time (sec): 197.64 - samples/sec: 7399.12 - lr: 0.100000
2023-04-05 23:58:30,278 ----------------------------------------------------------------------------------------------------
2023-04-05 23:58:30,278 EPOCH 26 done: loss 0.0353 - lr 0.100000
2023-04-05 23:58:30,278 BAD EPOCHS (no improvement): 0
2023-04-05 23:58:30,281 ----------------------------------------------------------------------------------------------------
2023-04-05 23:58:50,314 epoch 27 - iter 265/2650 - loss 0.03362257 - time (sec): 20.03 - samples/sec: 7335.13 - lr: 0.100000
2023-04-05 23:59:09,741 epoch 27 - iter 530/2650 - loss 0.03399168 - time (sec): 39.46 - samples/sec: 7402.14 - lr: 0.100000
2023-04-05 23:59:28,948 epoch 27 - iter 795/2650 - loss 0.03460693 - time (sec): 58.67 - samples/sec: 7446.46 - lr: 0.100000
2023-04-05 23:59:48,994 epoch 27 - iter 1060/2650 - loss 0.03505005 - time (sec): 78.71 - samples/sec: 7412.61 - lr: 0.100000
2023-04-06 00:00:08,569 epoch 27 - iter 1325/2650 - loss 0.03510471 - time (sec): 98.29 - samples/sec: 7420.29 - lr: 0.100000
2023-04-06 00:00:28,359 epoch 27 - iter 1590/2650 - loss 0.03527188 - time (sec): 118.08 - samples/sec: 7422.11 - lr: 0.100000
2023-04-06 00:00:47,938 epoch 27 - iter 1855/2650 - loss 0.03541833 - time (sec): 137.66 - samples/sec: 7426.88 - lr: 0.100000
2023-04-06 00:01:08,114 epoch 27 - iter 2120/2650 - loss 0.03580167 - time (sec): 157.83 - samples/sec: 7414.12 - lr: 0.100000
2023-04-06 00:01:28,076 epoch 27 - iter 2385/2650 - loss 0.03578883 - time (sec): 177.79 - samples/sec: 7404.56 - lr: 0.100000
2023-04-06 00:01:57,476 epoch 27 - iter 2650/2650 - loss 0.03560065 - time (sec): 207.19 - samples/sec: 7057.91 - lr: 0.100000
2023-04-06 00:01:57,476 ----------------------------------------------------------------------------------------------------
2023-04-06 00:01:57,476 EPOCH 27 done: loss 0.0356 - lr 0.100000
2023-04-06 00:01:57,476 BAD EPOCHS (no improvement): 1
2023-04-06 00:01:57,479 ----------------------------------------------------------------------------------------------------
2023-04-06 00:02:16,947 epoch 28 - iter 265/2650 - loss 0.03376720 - time (sec): 19.47 - samples/sec: 7429.25 - lr: 0.100000
2023-04-06 00:02:36,633 epoch 28 - iter 530/2650 - loss 0.03468018 - time (sec): 39.15 - samples/sec: 7448.92 - lr: 0.100000
2023-04-06 00:02:55,960 epoch 28 - iter 795/2650 - loss 0.03445018 - time (sec): 58.48 - samples/sec: 7474.34 - lr: 0.100000
2023-04-06 00:03:16,013 epoch 28 - iter 1060/2650 - loss 0.03433266 - time (sec): 78.53 - samples/sec: 7424.46 - lr: 0.100000
2023-04-06 00:03:35,844 epoch 28 - iter 1325/2650 - loss 0.03452301 - time (sec): 98.36 - samples/sec: 7411.61 - lr: 0.100000
2023-04-06 00:03:55,088 epoch 28 - iter 1590/2650 - loss 0.03439093 - time (sec): 117.61 - samples/sec: 7437.60 - lr: 0.100000
2023-04-06 00:04:15,301 epoch 28 - iter 1855/2650 - loss 0.03466075 - time (sec): 137.82 - samples/sec: 7412.01 - lr: 0.100000
2023-04-06 00:04:34,590 epoch 28 - iter 2120/2650 - loss 0.03459712 - time (sec): 157.11 - samples/sec: 7427.12 - lr: 0.100000
2023-04-06 00:04:54,572 epoch 28 - iter 2385/2650 - loss 0.03479347 - time (sec): 177.09 - samples/sec: 7422.76 - lr: 0.100000
2023-04-06 00:05:14,650 epoch 28 - iter 2650/2650 - loss 0.03491241 - time (sec): 197.17 - samples/sec: 7416.73 - lr: 0.100000
2023-04-06 00:05:14,650 ----------------------------------------------------------------------------------------------------
2023-04-06 00:05:14,650 EPOCH 28 done: loss 0.0349 - lr 0.100000
2023-04-06 00:05:14,650 BAD EPOCHS (no improvement): 0
2023-04-06 00:05:14,657 ----------------------------------------------------------------------------------------------------
2023-04-06 00:05:34,405 epoch 29 - iter 265/2650 - loss 0.03520691 - time (sec): 19.75 - samples/sec: 7415.80 - lr: 0.100000
2023-04-06 00:05:54,041 epoch 29 - iter 530/2650 - loss 0.03489647 - time (sec): 39.38 - samples/sec: 7421.36 - lr: 0.100000
2023-04-06 00:06:13,849 epoch 29 - iter 795/2650 - loss 0.03457302 - time (sec): 59.19 - samples/sec: 7402.64 - lr: 0.100000
2023-04-06 00:06:33,926 epoch 29 - iter 1060/2650 - loss 0.03435361 - time (sec): 79.27 - samples/sec: 7342.94 - lr: 0.100000
2023-04-06 00:06:53,579 epoch 29 - iter 1325/2650 - loss 0.03411601 - time (sec): 98.92 - samples/sec: 7378.13 - lr: 0.100000
2023-04-06 00:07:13,193 epoch 29 - iter 1590/2650 - loss 0.03390079 - time (sec): 118.54 - samples/sec: 7392.69 - lr: 0.100000
2023-04-06 00:07:33,217 epoch 29 - iter 1855/2650 - loss 0.03380149 - time (sec): 138.56 - samples/sec: 7387.28 - lr: 0.100000
2023-04-06 00:07:52,877 epoch 29 - iter 2120/2650 - loss 0.03393107 - time (sec): 158.22 - samples/sec: 7394.75 - lr: 0.100000
2023-04-06 00:08:12,973 epoch 29 - iter 2385/2650 - loss 0.03424095 - time (sec): 178.32 - samples/sec: 7382.56 - lr: 0.100000
2023-04-06 00:08:32,548 epoch 29 - iter 2650/2650 - loss 0.03451294 - time (sec): 197.89 - samples/sec: 7389.77 - lr: 0.100000
2023-04-06 00:08:32,548 ----------------------------------------------------------------------------------------------------
2023-04-06 00:08:32,548 EPOCH 29 done: loss 0.0345 - lr 0.100000
2023-04-06 00:08:32,548 BAD EPOCHS (no improvement): 0
2023-04-06 00:08:32,552 ----------------------------------------------------------------------------------------------------
2023-04-06 00:08:52,996 epoch 30 - iter 265/2650 - loss 0.03395257 - time (sec): 20.44 - samples/sec: 7169.77 - lr: 0.100000
2023-04-06 00:09:12,646 epoch 30 - iter 530/2650 - loss 0.03399807 - time (sec): 40.09 - samples/sec: 7309.66 - lr: 0.100000
2023-04-06 00:09:32,149 epoch 30 - iter 795/2650 - loss 0.03387434 - time (sec): 59.60 - samples/sec: 7379.49 - lr: 0.100000
2023-04-06 00:09:51,642 epoch 30 - iter 1060/2650 - loss 0.03376718 - time (sec): 79.09 - samples/sec: 7394.78 - lr: 0.100000
2023-04-06 00:10:10,841 epoch 30 - iter 1325/2650 - loss 0.03378417 - time (sec): 98.29 - samples/sec: 7418.95 - lr: 0.100000
2023-04-06 00:10:30,969 epoch 30 - iter 1590/2650 - loss 0.03420691 - time (sec): 118.42 - samples/sec: 7412.24 - lr: 0.100000
2023-04-06 00:10:51,481 epoch 30 - iter 1855/2650 - loss 0.03415484 - time (sec): 138.93 - samples/sec: 7380.61 - lr: 0.100000
2023-04-06 00:11:11,685 epoch 30 - iter 2120/2650 - loss 0.03428628 - time (sec): 159.13 - samples/sec: 7362.62 - lr: 0.100000
2023-04-06 00:11:30,571 epoch 30 - iter 2385/2650 - loss 0.03438469 - time (sec): 178.02 - samples/sec: 7396.79 - lr: 0.100000
2023-04-06 00:11:49,934 epoch 30 - iter 2650/2650 - loss 0.03446056 - time (sec): 197.38 - samples/sec: 7408.81 - lr: 0.100000
2023-04-06 00:11:49,934 ----------------------------------------------------------------------------------------------------
2023-04-06 00:11:49,934 EPOCH 30 done: loss 0.0345 - lr 0.100000
2023-04-06 00:11:49,934 BAD EPOCHS (no improvement): 0
2023-04-06 00:11:49,937 ----------------------------------------------------------------------------------------------------
2023-04-06 00:12:09,552 epoch 31 - iter 265/2650 - loss 0.03284975 - time (sec): 19.62 - samples/sec: 7446.77 - lr: 0.100000
2023-04-06 00:12:28,505 epoch 31 - iter 530/2650 - loss 0.03379530 - time (sec): 38.57 - samples/sec: 7525.61 - lr: 0.100000
2023-04-06 00:12:48,101 epoch 31 - iter 795/2650 - loss 0.03398737 - time (sec): 58.16 - samples/sec: 7500.71 - lr: 0.100000
2023-04-06 00:13:07,682 epoch 31 - iter 1060/2650 - loss 0.03362497 - time (sec): 77.75 - samples/sec: 7482.14 - lr: 0.100000
2023-04-06 00:13:27,001 epoch 31 - iter 1325/2650 - loss 0.03324185 - time (sec): 97.06 - samples/sec: 7503.81 - lr: 0.100000
2023-04-06 00:13:46,356 epoch 31 - iter 1590/2650 - loss 0.03352791 - time (sec): 116.42 - samples/sec: 7511.85 - lr: 0.100000
2023-04-06 00:14:06,553 epoch 31 - iter 1855/2650 - loss 0.03357960 - time (sec): 136.62 - samples/sec: 7488.90 - lr: 0.100000
2023-04-06 00:14:26,798 epoch 31 - iter 2120/2650 - loss 0.03383146 - time (sec): 156.86 - samples/sec: 7456.87 - lr: 0.100000
2023-04-06 00:14:46,232 epoch 31 - iter 2385/2650 - loss 0.03383897 - time (sec): 176.29 - samples/sec: 7458.15 - lr: 0.100000
2023-04-06 00:15:06,225 epoch 31 - iter 2650/2650 - loss 0.03414900 - time (sec): 196.29 - samples/sec: 7450.11 - lr: 0.100000
2023-04-06 00:15:06,225 ----------------------------------------------------------------------------------------------------
2023-04-06 00:15:06,225 EPOCH 31 done: loss 0.0341 - lr 0.100000
2023-04-06 00:15:06,225 BAD EPOCHS (no improvement): 0
2023-04-06 00:15:06,228 ----------------------------------------------------------------------------------------------------
2023-04-06 00:15:26,099 epoch 32 - iter 265/2650 - loss 0.03147212 - time (sec): 19.87 - samples/sec: 7409.54 - lr: 0.100000
2023-04-06 00:15:45,471 epoch 32 - iter 530/2650 - loss 0.03254038 - time (sec): 39.24 - samples/sec: 7419.60 - lr: 0.100000
2023-04-06 00:16:05,580 epoch 32 - iter 795/2650 - loss 0.03283877 - time (sec): 59.35 - samples/sec: 7386.18 - lr: 0.100000
2023-04-06 00:16:25,099 epoch 32 - iter 1060/2650 - loss 0.03341404 - time (sec): 78.87 - samples/sec: 7401.66 - lr: 0.100000
2023-04-06 00:16:44,253 epoch 32 - iter 1325/2650 - loss 0.03331500 - time (sec): 98.02 - samples/sec: 7437.30 - lr: 0.100000
2023-04-06 00:17:04,537 epoch 32 - iter 1590/2650 - loss 0.03321410 - time (sec): 118.31 - samples/sec: 7414.28 - lr: 0.100000
2023-04-06 00:17:24,099 epoch 32 - iter 1855/2650 - loss 0.03327462 - time (sec): 137.87 - samples/sec: 7426.39 - lr: 0.100000
2023-04-06 00:17:43,875 epoch 32 - iter 2120/2650 - loss 0.03345272 - time (sec): 157.65 - samples/sec: 7427.04 - lr: 0.100000
2023-04-06 00:18:04,089 epoch 32 - iter 2385/2650 - loss 0.03349502 - time (sec): 177.86 - samples/sec: 7405.32 - lr: 0.100000
2023-04-06 00:18:23,777 epoch 32 - iter 2650/2650 - loss 0.03352724 - time (sec): 197.55 - samples/sec: 7402.55 - lr: 0.100000
2023-04-06 00:18:23,777 ----------------------------------------------------------------------------------------------------
2023-04-06 00:18:23,777 EPOCH 32 done: loss 0.0335 - lr 0.100000
2023-04-06 00:18:23,777 BAD EPOCHS (no improvement): 0
2023-04-06 00:18:23,781 ----------------------------------------------------------------------------------------------------
2023-04-06 00:18:43,620 epoch 33 - iter 265/2650 - loss 0.03293303 - time (sec): 19.84 - samples/sec: 7451.97 - lr: 0.100000
2023-04-06 00:19:04,013 epoch 33 - iter 530/2650 - loss 0.03251799 - time (sec): 40.23 - samples/sec: 7300.16 - lr: 0.100000
2023-04-06 00:19:23,642 epoch 33 - iter 795/2650 - loss 0.03294772 - time (sec): 59.86 - samples/sec: 7345.09 - lr: 0.100000
2023-04-06 00:19:42,895 epoch 33 - iter 1060/2650 - loss 0.03348348 - time (sec): 79.11 - samples/sec: 7409.27 - lr: 0.100000
2023-04-06 00:20:02,713 epoch 33 - iter 1325/2650 - loss 0.03377057 - time (sec): 98.93 - samples/sec: 7403.47 - lr: 0.100000
2023-04-06 00:20:21,899 epoch 33 - iter 1590/2650 - loss 0.03371583 - time (sec): 118.12 - samples/sec: 7436.49 - lr: 0.100000
2023-04-06 00:20:41,457 epoch 33 - iter 1855/2650 - loss 0.03416892 - time (sec): 137.68 - samples/sec: 7427.54 - lr: 0.100000
2023-04-06 00:21:01,231 epoch 33 - iter 2120/2650 - loss 0.03417072 - time (sec): 157.45 - samples/sec: 7428.38 - lr: 0.100000
2023-04-06 00:21:21,101 epoch 33 - iter 2385/2650 - loss 0.03407835 - time (sec): 177.32 - samples/sec: 7423.52 - lr: 0.100000
2023-04-06 00:21:40,710 epoch 33 - iter 2650/2650 - loss 0.03383043 - time (sec): 196.93 - samples/sec: 7425.85 - lr: 0.100000
2023-04-06 00:21:40,711 ----------------------------------------------------------------------------------------------------
2023-04-06 00:21:40,711 EPOCH 33 done: loss 0.0338 - lr 0.100000
2023-04-06 00:21:40,711 BAD EPOCHS (no improvement): 1
2023-04-06 00:21:40,714 ----------------------------------------------------------------------------------------------------
2023-04-06 00:22:00,161 epoch 34 - iter 265/2650 - loss 0.03278319 - time (sec): 19.45 - samples/sec: 7497.91 - lr: 0.100000
2023-04-06 00:22:19,630 epoch 34 - iter 530/2650 - loss 0.03157491 - time (sec): 38.92 - samples/sec: 7545.00 - lr: 0.100000
2023-04-06 00:22:39,592 epoch 34 - iter 795/2650 - loss 0.03197803 - time (sec): 58.88 - samples/sec: 7480.62 - lr: 0.100000
2023-04-06 00:22:59,215 epoch 34 - iter 1060/2650 - loss 0.03199495 - time (sec): 78.50 - samples/sec: 7443.87 - lr: 0.100000
2023-04-06 00:23:19,204 epoch 34 - iter 1325/2650 - loss 0.03200690 - time (sec): 98.49 - samples/sec: 7428.28 - lr: 0.100000
2023-04-06 00:23:39,422 epoch 34 - iter 1590/2650 - loss 0.03210826 - time (sec): 118.71 - samples/sec: 7404.68 - lr: 0.100000
2023-04-06 00:23:58,385 epoch 34 - iter 1855/2650 - loss 0.03260764 - time (sec): 137.67 - samples/sec: 7425.00 - lr: 0.100000
2023-04-06 00:24:17,848 epoch 34 - iter 2120/2650 - loss 0.03303662 - time (sec): 157.13 - samples/sec: 7435.83 - lr: 0.100000
2023-04-06 00:24:37,227 epoch 34 - iter 2385/2650 - loss 0.03309589 - time (sec): 176.51 - samples/sec: 7454.00 - lr: 0.100000
2023-04-06 00:24:57,498 epoch 34 - iter 2650/2650 - loss 0.03331709 - time (sec): 196.78 - samples/sec: 7431.32 - lr: 0.100000
2023-04-06 00:24:57,498 ----------------------------------------------------------------------------------------------------
2023-04-06 00:24:57,498 EPOCH 34 done: loss 0.0333 - lr 0.100000
2023-04-06 00:24:57,498 BAD EPOCHS (no improvement): 0
2023-04-06 00:24:57,502 ----------------------------------------------------------------------------------------------------
2023-04-06 00:25:16,904 epoch 35 - iter 265/2650 - loss 0.03371517 - time (sec): 19.40 - samples/sec: 7495.97 - lr: 0.100000
2023-04-06 00:25:36,441 epoch 35 - iter 530/2650 - loss 0.03327525 - time (sec): 38.94 - samples/sec: 7496.87 - lr: 0.100000
2023-04-06 00:25:56,859 epoch 35 - iter 795/2650 - loss 0.03318869 - time (sec): 59.36 - samples/sec: 7395.84 - lr: 0.100000
2023-04-06 00:26:15,956 epoch 35 - iter 1060/2650 - loss 0.03318483 - time (sec): 78.45 - samples/sec: 7456.93 - lr: 0.100000
2023-04-06 00:26:35,646 epoch 35 - iter 1325/2650 - loss 0.03278300 - time (sec): 98.14 - samples/sec: 7448.75 - lr: 0.100000
2023-04-06 00:26:55,552 epoch 35 - iter 1590/2650 - loss 0.03281379 - time (sec): 118.05 - samples/sec: 7430.99 - lr: 0.100000
2023-04-06 00:27:15,148 epoch 35 - iter 1855/2650 - loss 0.03312041 - time (sec): 137.65 - samples/sec: 7441.62 - lr: 0.100000
2023-04-06 00:27:34,639 epoch 35 - iter 2120/2650 - loss 0.03325428 - time (sec): 157.14 - samples/sec: 7452.69 - lr: 0.100000
2023-04-06 00:27:54,506 epoch 35 - iter 2385/2650 - loss 0.03341533 - time (sec): 177.00 - samples/sec: 7436.04 - lr: 0.100000
2023-04-06 00:28:14,146 epoch 35 - iter 2650/2650 - loss 0.03334312 - time (sec): 196.64 - samples/sec: 7436.61 - lr: 0.100000
2023-04-06 00:28:14,146 ----------------------------------------------------------------------------------------------------
2023-04-06 00:28:14,146 EPOCH 35 done: loss 0.0333 - lr 0.100000
2023-04-06 00:28:14,146 BAD EPOCHS (no improvement): 1
2023-04-06 00:28:14,150 ----------------------------------------------------------------------------------------------------
2023-04-06 00:28:33,897 epoch 36 - iter 265/2650 - loss 0.03222855 - time (sec): 19.75 - samples/sec: 7394.01 - lr: 0.100000
2023-04-06 00:28:53,926 epoch 36 - iter 530/2650 - loss 0.03294347 - time (sec): 39.78 - samples/sec: 7367.28 - lr: 0.100000
2023-04-06 00:29:13,800 epoch 36 - iter 795/2650 - loss 0.03294840 - time (sec): 59.65 - samples/sec: 7354.79 - lr: 0.100000
2023-04-06 00:29:33,708 epoch 36 - iter 1060/2650 - loss 0.03310192 - time (sec): 79.56 - samples/sec: 7363.33 - lr: 0.100000
2023-04-06 00:29:53,294 epoch 36 - iter 1325/2650 - loss 0.03333328 - time (sec): 99.14 - samples/sec: 7384.62 - lr: 0.100000
2023-04-06 00:30:13,575 epoch 36 - iter 1590/2650 - loss 0.03332831 - time (sec): 119.42 - samples/sec: 7355.71 - lr: 0.100000
2023-04-06 00:30:33,209 epoch 36 - iter 1855/2650 - loss 0.03327820 - time (sec): 139.06 - samples/sec: 7379.34 - lr: 0.100000
2023-04-06 00:30:52,698 epoch 36 - iter 2120/2650 - loss 0.03324384 - time (sec): 158.55 - samples/sec: 7384.65 - lr: 0.100000
2023-04-06 00:31:12,092 epoch 36 - iter 2385/2650 - loss 0.03315761 - time (sec): 177.94 - samples/sec: 7399.17 - lr: 0.100000
2023-04-06 00:31:31,774 epoch 36 - iter 2650/2650 - loss 0.03329781 - time (sec): 197.62 - samples/sec: 7399.72 - lr: 0.100000
2023-04-06 00:31:31,775 ----------------------------------------------------------------------------------------------------
2023-04-06 00:31:31,775 EPOCH 36 done: loss 0.0333 - lr 0.100000
2023-04-06 00:31:31,775 BAD EPOCHS (no improvement): 0
2023-04-06 00:31:31,779 ----------------------------------------------------------------------------------------------------
2023-04-06 00:31:51,300 epoch 37 - iter 265/2650 - loss 0.03098230 - time (sec): 19.52 - samples/sec: 7518.36 - lr: 0.100000
2023-04-06 00:32:11,094 epoch 37 - iter 530/2650 - loss 0.03164831 - time (sec): 39.32 - samples/sec: 7465.97 - lr: 0.100000
2023-04-06 00:32:31,379 epoch 37 - iter 795/2650 - loss 0.03213314 - time (sec): 59.60 - samples/sec: 7383.20 - lr: 0.100000
2023-04-06 00:32:51,399 epoch 37 - iter 1060/2650 - loss 0.03273908 - time (sec): 79.62 - samples/sec: 7355.85 - lr: 0.100000
2023-04-06 00:33:10,812 epoch 37 - iter 1325/2650 - loss 0.03233362 - time (sec): 99.03 - samples/sec: 7380.96 - lr: 0.100000
2023-04-06 00:33:30,381 epoch 37 - iter 1590/2650 - loss 0.03244750 - time (sec): 118.60 - samples/sec: 7388.45 - lr: 0.100000
2023-04-06 00:33:50,354 epoch 37 - iter 1855/2650 - loss 0.03242242 - time (sec): 138.58 - samples/sec: 7385.99 - lr: 0.100000
2023-04-06 00:34:10,547 epoch 37 - iter 2120/2650 - loss 0.03220163 - time (sec): 158.77 - samples/sec: 7379.11 - lr: 0.100000
2023-04-06 00:34:29,902 epoch 37 - iter 2385/2650 - loss 0.03234543 - time (sec): 178.12 - samples/sec: 7393.21 - lr: 0.100000
2023-04-06 00:34:49,528 epoch 37 - iter 2650/2650 - loss 0.03254981 - time (sec): 197.75 - samples/sec: 7395.03 - lr: 0.100000
2023-04-06 00:34:49,529 ----------------------------------------------------------------------------------------------------
2023-04-06 00:34:49,529 EPOCH 37 done: loss 0.0325 - lr 0.100000
2023-04-06 00:34:49,529 BAD EPOCHS (no improvement): 0
2023-04-06 00:34:49,532 ----------------------------------------------------------------------------------------------------
2023-04-06 00:35:09,720 epoch 38 - iter 265/2650 - loss 0.03299786 - time (sec): 20.19 - samples/sec: 7313.39 - lr: 0.100000
2023-04-06 00:35:29,227 epoch 38 - iter 530/2650 - loss 0.03246811 - time (sec): 39.70 - samples/sec: 7404.89 - lr: 0.100000
2023-04-06 00:35:49,251 epoch 38 - iter 795/2650 - loss 0.03282325 - time (sec): 59.72 - samples/sec: 7357.62 - lr: 0.100000
2023-04-06 00:36:08,965 epoch 38 - iter 1060/2650 - loss 0.03258656 - time (sec): 79.43 - samples/sec: 7367.71 - lr: 0.100000
2023-04-06 00:36:28,845 epoch 38 - iter 1325/2650 - loss 0.03257285 - time (sec): 99.31 - samples/sec: 7374.48 - lr: 0.100000
2023-04-06 00:36:48,700 epoch 38 - iter 1590/2650 - loss 0.03260684 - time (sec): 119.17 - samples/sec: 7375.71 - lr: 0.100000
2023-04-06 00:37:08,370 epoch 38 - iter 1855/2650 - loss 0.03263233 - time (sec): 138.84 - samples/sec: 7379.81 - lr: 0.100000
2023-04-06 00:37:27,968 epoch 38 - iter 2120/2650 - loss 0.03250047 - time (sec): 158.44 - samples/sec: 7385.57 - lr: 0.100000
2023-04-06 00:37:47,872 epoch 38 - iter 2385/2650 - loss 0.03254518 - time (sec): 178.34 - samples/sec: 7383.94 - lr: 0.100000
2023-04-06 00:38:07,359 epoch 38 - iter 2650/2650 - loss 0.03256946 - time (sec): 197.83 - samples/sec: 7392.10 - lr: 0.100000
2023-04-06 00:38:07,360 ----------------------------------------------------------------------------------------------------
2023-04-06 00:38:07,360 EPOCH 38 done: loss 0.0326 - lr 0.100000
2023-04-06 00:38:07,360 BAD EPOCHS (no improvement): 1
2023-04-06 00:38:07,363 ----------------------------------------------------------------------------------------------------
2023-04-06 00:38:27,051 epoch 39 - iter 265/2650 - loss 0.03232815 - time (sec): 19.69 - samples/sec: 7345.86 - lr: 0.100000
2023-04-06 00:38:46,782 epoch 39 - iter 530/2650 - loss 0.03211567 - time (sec): 39.42 - samples/sec: 7403.54 - lr: 0.100000
2023-04-06 00:39:06,599 epoch 39 - iter 795/2650 - loss 0.03195153 - time (sec): 59.24 - samples/sec: 7409.57 - lr: 0.100000
2023-04-06 00:39:26,775 epoch 39 - iter 1060/2650 - loss 0.03162712 - time (sec): 79.41 - samples/sec: 7379.61 - lr: 0.100000
2023-04-06 00:39:46,523 epoch 39 - iter 1325/2650 - loss 0.03230874 - time (sec): 99.16 - samples/sec: 7379.12 - lr: 0.100000
2023-04-06 00:40:06,130 epoch 39 - iter 1590/2650 - loss 0.03233712 - time (sec): 118.77 - samples/sec: 7393.02 - lr: 0.100000
2023-04-06 00:40:25,864 epoch 39 - iter 1855/2650 - loss 0.03245259 - time (sec): 138.50 - samples/sec: 7394.45 - lr: 0.100000
2023-04-06 00:40:45,769 epoch 39 - iter 2120/2650 - loss 0.03263103 - time (sec): 158.41 - samples/sec: 7388.52 - lr: 0.100000
2023-04-06 00:41:05,497 epoch 39 - iter 2385/2650 - loss 0.03265201 - time (sec): 178.13 - samples/sec: 7398.60 - lr: 0.100000
2023-04-06 00:41:24,989 epoch 39 - iter 2650/2650 - loss 0.03264164 - time (sec): 197.63 - samples/sec: 7399.66 - lr: 0.100000
2023-04-06 00:41:24,989 ----------------------------------------------------------------------------------------------------
2023-04-06 00:41:24,989 EPOCH 39 done: loss 0.0326 - lr 0.100000
2023-04-06 00:41:24,990 BAD EPOCHS (no improvement): 2
2023-04-06 00:41:24,993 ----------------------------------------------------------------------------------------------------
2023-04-06 00:41:45,177 epoch 40 - iter 265/2650 - loss 0.03287872 - time (sec): 20.18 - samples/sec: 7383.20 - lr: 0.100000
2023-04-06 00:42:04,779 epoch 40 - iter 530/2650 - loss 0.03217006 - time (sec): 39.79 - samples/sec: 7402.31 - lr: 0.100000
2023-04-06 00:42:24,709 epoch 40 - iter 795/2650 - loss 0.03147888 - time (sec): 59.72 - samples/sec: 7373.96 - lr: 0.100000
2023-04-06 00:42:44,139 epoch 40 - iter 1060/2650 - loss 0.03181766 - time (sec): 79.15 - samples/sec: 7401.65 - lr: 0.100000
2023-04-06 00:43:03,362 epoch 40 - iter 1325/2650 - loss 0.03176037 - time (sec): 98.37 - samples/sec: 7429.62 - lr: 0.100000
2023-04-06 00:43:22,956 epoch 40 - iter 1590/2650 - loss 0.03192259 - time (sec): 117.96 - samples/sec: 7431.03 - lr: 0.100000
2023-04-06 00:43:42,668 epoch 40 - iter 1855/2650 - loss 0.03229649 - time (sec): 137.68 - samples/sec: 7434.31 - lr: 0.100000
2023-04-06 00:44:01,895 epoch 40 - iter 2120/2650 - loss 0.03235237 - time (sec): 156.90 - samples/sec: 7453.33 - lr: 0.100000
2023-04-06 00:44:21,789 epoch 40 - iter 2385/2650 - loss 0.03268056 - time (sec): 176.80 - samples/sec: 7449.19 - lr: 0.100000
2023-04-06 00:44:41,159 epoch 40 - iter 2650/2650 - loss 0.03280627 - time (sec): 196.17 - samples/sec: 7454.72 - lr: 0.100000
2023-04-06 00:44:41,160 ----------------------------------------------------------------------------------------------------
2023-04-06 00:44:41,160 EPOCH 40 done: loss 0.0328 - lr 0.100000
2023-04-06 00:44:41,160 BAD EPOCHS (no improvement): 3
2023-04-06 00:44:41,163 ----------------------------------------------------------------------------------------------------
2023-04-06 00:45:00,702 epoch 41 - iter 265/2650 - loss 0.03174940 - time (sec): 19.54 - samples/sec: 7506.79 - lr: 0.100000
2023-04-06 00:45:19,887 epoch 41 - iter 530/2650 - loss 0.03177356 - time (sec): 38.72 - samples/sec: 7538.45 - lr: 0.100000
2023-04-06 00:45:39,151 epoch 41 - iter 795/2650 - loss 0.03247711 - time (sec): 57.99 - samples/sec: 7542.75 - lr: 0.100000
2023-04-06 00:45:58,892 epoch 41 - iter 1060/2650 - loss 0.03228117 - time (sec): 77.73 - samples/sec: 7519.18 - lr: 0.100000
2023-04-06 00:46:18,378 epoch 41 - iter 1325/2650 - loss 0.03205756 - time (sec): 97.21 - samples/sec: 7512.14 - lr: 0.100000
2023-04-06 00:46:37,887 epoch 41 - iter 1590/2650 - loss 0.03237677 - time (sec): 116.72 - samples/sec: 7510.67 - lr: 0.100000
2023-04-06 00:46:57,176 epoch 41 - iter 1855/2650 - loss 0.03213207 - time (sec): 136.01 - samples/sec: 7512.22 - lr: 0.100000
2023-04-06 00:47:17,212 epoch 41 - iter 2120/2650 - loss 0.03251258 - time (sec): 156.05 - samples/sec: 7503.62 - lr: 0.100000
2023-04-06 00:47:37,218 epoch 41 - iter 2385/2650 - loss 0.03257332 - time (sec): 176.05 - samples/sec: 7483.69 - lr: 0.100000
2023-04-06 00:47:56,229 epoch 41 - iter 2650/2650 - loss 0.03272240 - time (sec): 195.07 - samples/sec: 7496.76 - lr: 0.100000
2023-04-06 00:47:56,229 ----------------------------------------------------------------------------------------------------
2023-04-06 00:47:56,229 EPOCH 41 done: loss 0.0327 - lr 0.100000
2023-04-06 00:47:56,229 Epoch 41: reducing learning rate of group 0 to 5.0000e-02.
2023-04-06 00:47:56,229 BAD EPOCHS (no improvement): 4
2023-04-06 00:47:56,232 ----------------------------------------------------------------------------------------------------
2023-04-06 00:48:15,885 epoch 42 - iter 265/2650 - loss 0.02955627 - time (sec): 19.65 - samples/sec: 7414.30 - lr: 0.050000
2023-04-06 00:48:35,659 epoch 42 - iter 530/2650 - loss 0.03067219 - time (sec): 39.43 - samples/sec: 7497.40 - lr: 0.050000
2023-04-06 00:48:54,990 epoch 42 - iter 795/2650 - loss 0.03056294 - time (sec): 58.76 - samples/sec: 7471.31 - lr: 0.050000
2023-04-06 00:49:14,285 epoch 42 - iter 1060/2650 - loss 0.03019372 - time (sec): 78.05 - samples/sec: 7505.97 - lr: 0.050000
2023-04-06 00:49:33,798 epoch 42 - iter 1325/2650 - loss 0.03022652 - time (sec): 97.57 - samples/sec: 7504.12 - lr: 0.050000
2023-04-06 00:49:53,249 epoch 42 - iter 1590/2650 - loss 0.02997870 - time (sec): 117.02 - samples/sec: 7509.38 - lr: 0.050000
2023-04-06 00:50:12,663 epoch 42 - iter 1855/2650 - loss 0.02969903 - time (sec): 136.43 - samples/sec: 7508.86 - lr: 0.050000
2023-04-06 00:50:32,607 epoch 42 - iter 2120/2650 - loss 0.02975692 - time (sec): 156.37 - samples/sec: 7490.84 - lr: 0.050000
2023-04-06 00:50:52,034 epoch 42 - iter 2385/2650 - loss 0.02977858 - time (sec): 175.80 - samples/sec: 7493.21 - lr: 0.050000
2023-04-06 00:51:10,965 epoch 42 - iter 2650/2650 - loss 0.02985402 - time (sec): 194.73 - samples/sec: 7509.59 - lr: 0.050000
2023-04-06 00:51:10,965 ----------------------------------------------------------------------------------------------------
2023-04-06 00:51:10,965 EPOCH 42 done: loss 0.0299 - lr 0.050000
2023-04-06 00:51:10,965 BAD EPOCHS (no improvement): 0
2023-04-06 00:51:10,968 ----------------------------------------------------------------------------------------------------
2023-04-06 00:51:31,043 epoch 43 - iter 265/2650 - loss 0.02799119 - time (sec): 20.07 - samples/sec: 7386.23 - lr: 0.050000
2023-04-06 00:51:50,322 epoch 43 - iter 530/2650 - loss 0.02850278 - time (sec): 39.35 - samples/sec: 7450.60 - lr: 0.050000
2023-04-06 00:52:09,860 epoch 43 - iter 795/2650 - loss 0.02804902 - time (sec): 58.89 - samples/sec: 7485.08 - lr: 0.050000
2023-04-06 00:52:28,808 epoch 43 - iter 1060/2650 - loss 0.02836221 - time (sec): 77.84 - samples/sec: 7556.55 - lr: 0.050000
2023-04-06 00:52:48,170 epoch 43 - iter 1325/2650 - loss 0.02851280 - time (sec): 97.20 - samples/sec: 7541.96 - lr: 0.050000
2023-04-06 00:53:17,979 epoch 43 - iter 1590/2650 - loss 0.02869553 - time (sec): 127.01 - samples/sec: 6931.99 - lr: 0.050000
2023-04-06 00:53:37,000 epoch 43 - iter 1855/2650 - loss 0.02883675 - time (sec): 146.03 - samples/sec: 7008.85 - lr: 0.050000
2023-04-06 00:53:56,381 epoch 43 - iter 2120/2650 - loss 0.02869713 - time (sec): 165.41 - samples/sec: 7080.72 - lr: 0.050000
2023-04-06 00:54:15,961 epoch 43 - iter 2385/2650 - loss 0.02862479 - time (sec): 184.99 - samples/sec: 7117.62 - lr: 0.050000
2023-04-06 00:54:35,447 epoch 43 - iter 2650/2650 - loss 0.02866556 - time (sec): 204.48 - samples/sec: 7151.67 - lr: 0.050000
2023-04-06 00:54:35,447 ----------------------------------------------------------------------------------------------------
2023-04-06 00:54:35,447 EPOCH 43 done: loss 0.0287 - lr 0.050000
2023-04-06 00:54:35,447 BAD EPOCHS (no improvement): 0
2023-04-06 00:54:35,451 ----------------------------------------------------------------------------------------------------
2023-04-06 00:54:55,598 epoch 44 - iter 265/2650 - loss 0.02710923 - time (sec): 20.15 - samples/sec: 7285.25 - lr: 0.050000
2023-04-06 00:55:14,932 epoch 44 - iter 530/2650 - loss 0.02736684 - time (sec): 39.48 - samples/sec: 7439.46 - lr: 0.050000
2023-04-06 00:55:34,411 epoch 44 - iter 795/2650 - loss 0.02717180 - time (sec): 58.96 - samples/sec: 7460.04 - lr: 0.050000
2023-04-06 00:55:53,601 epoch 44 - iter 1060/2650 - loss 0.02729847 - time (sec): 78.15 - samples/sec: 7485.47 - lr: 0.050000
2023-04-06 00:56:12,925 epoch 44 - iter 1325/2650 - loss 0.02746751 - time (sec): 97.47 - samples/sec: 7509.87 - lr: 0.050000
2023-04-06 00:56:32,029 epoch 44 - iter 1590/2650 - loss 0.02758190 - time (sec): 116.58 - samples/sec: 7531.09 - lr: 0.050000
2023-04-06 00:56:51,588 epoch 44 - iter 1855/2650 - loss 0.02744225 - time (sec): 136.14 - samples/sec: 7534.10 - lr: 0.050000
2023-04-06 00:57:11,142 epoch 44 - iter 2120/2650 - loss 0.02742070 - time (sec): 155.69 - samples/sec: 7536.47 - lr: 0.050000
2023-04-06 00:57:30,320 epoch 44 - iter 2385/2650 - loss 0.02746320 - time (sec): 174.87 - samples/sec: 7537.48 - lr: 0.050000
2023-04-06 00:57:49,294 epoch 44 - iter 2650/2650 - loss 0.02770175 - time (sec): 193.84 - samples/sec: 7544.07 - lr: 0.050000
2023-04-06 00:57:49,294 ----------------------------------------------------------------------------------------------------
2023-04-06 00:57:49,294 EPOCH 44 done: loss 0.0277 - lr 0.050000
2023-04-06 00:57:49,294 BAD EPOCHS (no improvement): 0
2023-04-06 00:57:49,298 ----------------------------------------------------------------------------------------------------
2023-04-06 00:58:08,191 epoch 45 - iter 265/2650 - loss 0.02509799 - time (sec): 18.89 - samples/sec: 7671.46 - lr: 0.050000
2023-04-06 00:58:27,340 epoch 45 - iter 530/2650 - loss 0.02702074 - time (sec): 38.04 - samples/sec: 7632.51 - lr: 0.050000
2023-04-06 00:58:46,386 epoch 45 - iter 795/2650 - loss 0.02733058 - time (sec): 57.09 - samples/sec: 7633.16 - lr: 0.050000
2023-04-06 00:59:05,973 epoch 45 - iter 1060/2650 - loss 0.02741913 - time (sec): 76.68 - samples/sec: 7591.64 - lr: 0.050000
2023-04-06 00:59:25,313 epoch 45 - iter 1325/2650 - loss 0.02708325 - time (sec): 96.02 - samples/sec: 7580.69 - lr: 0.050000
2023-04-06 00:59:44,651 epoch 45 - iter 1590/2650 - loss 0.02708799 - time (sec): 115.35 - samples/sec: 7581.25 - lr: 0.050000
2023-04-06 01:00:04,840 epoch 45 - iter 1855/2650 - loss 0.02726109 - time (sec): 135.54 - samples/sec: 7542.93 - lr: 0.050000
2023-04-06 01:00:24,277 epoch 45 - iter 2120/2650 - loss 0.02740109 - time (sec): 154.98 - samples/sec: 7540.46 - lr: 0.050000
2023-04-06 01:00:43,609 epoch 45 - iter 2385/2650 - loss 0.02735535 - time (sec): 174.31 - samples/sec: 7545.51 - lr: 0.050000
2023-04-06 01:01:03,274 epoch 45 - iter 2650/2650 - loss 0.02734510 - time (sec): 193.98 - samples/sec: 7538.87 - lr: 0.050000
2023-04-06 01:01:03,274 ----------------------------------------------------------------------------------------------------
2023-04-06 01:01:03,275 EPOCH 45 done: loss 0.0273 - lr 0.050000
2023-04-06 01:01:03,275 BAD EPOCHS (no improvement): 0
2023-04-06 01:01:03,290 ----------------------------------------------------------------------------------------------------
2023-04-06 01:01:22,877 epoch 46 - iter 265/2650 - loss 0.02775340 - time (sec): 19.59 - samples/sec: 7494.71 - lr: 0.050000
2023-04-06 01:01:42,020 epoch 46 - iter 530/2650 - loss 0.02753420 - time (sec): 38.73 - samples/sec: 7580.22 - lr: 0.050000
2023-04-06 01:02:01,173 epoch 46 - iter 795/2650 - loss 0.02683500 - time (sec): 57.88 - samples/sec: 7583.53 - lr: 0.050000
2023-04-06 01:02:21,033 epoch 46 - iter 1060/2650 - loss 0.02685330 - time (sec): 77.74 - samples/sec: 7518.09 - lr: 0.050000
2023-04-06 01:02:40,432 epoch 46 - iter 1325/2650 - loss 0.02686319 - time (sec): 97.14 - samples/sec: 7516.76 - lr: 0.050000
2023-04-06 01:03:00,209 epoch 46 - iter 1590/2650 - loss 0.02686952 - time (sec): 116.92 - samples/sec: 7510.60 - lr: 0.050000
2023-04-06 01:03:20,298 epoch 46 - iter 1855/2650 - loss 0.02696233 - time (sec): 137.01 - samples/sec: 7487.66 - lr: 0.050000
2023-04-06 01:03:39,539 epoch 46 - iter 2120/2650 - loss 0.02709771 - time (sec): 156.25 - samples/sec: 7503.20 - lr: 0.050000
2023-04-06 01:03:58,387 epoch 46 - iter 2385/2650 - loss 0.02721047 - time (sec): 175.10 - samples/sec: 7516.19 - lr: 0.050000
2023-04-06 01:04:17,792 epoch 46 - iter 2650/2650 - loss 0.02726884 - time (sec): 194.50 - samples/sec: 7518.52 - lr: 0.050000
2023-04-06 01:04:17,792 ----------------------------------------------------------------------------------------------------
2023-04-06 01:04:17,792 EPOCH 46 done: loss 0.0273 - lr 0.050000
2023-04-06 01:04:17,792 BAD EPOCHS (no improvement): 0
2023-04-06 01:04:17,796 ----------------------------------------------------------------------------------------------------
2023-04-06 01:04:37,761 epoch 47 - iter 265/2650 - loss 0.02579254 - time (sec): 19.96 - samples/sec: 7396.54 - lr: 0.050000
2023-04-06 01:04:56,809 epoch 47 - iter 530/2650 - loss 0.02520486 - time (sec): 39.01 - samples/sec: 7504.27 - lr: 0.050000
2023-04-06 01:05:16,166 epoch 47 - iter 795/2650 - loss 0.02535124 - time (sec): 58.37 - samples/sec: 7510.62 - lr: 0.050000
2023-04-06 01:05:35,787 epoch 47 - iter 1060/2650 - loss 0.02555169 - time (sec): 77.99 - samples/sec: 7503.14 - lr: 0.050000
2023-04-06 01:05:55,087 epoch 47 - iter 1325/2650 - loss 0.02568268 - time (sec): 97.29 - samples/sec: 7504.14 - lr: 0.050000
2023-04-06 01:06:14,578 epoch 47 - iter 1590/2650 - loss 0.02589613 - time (sec): 116.78 - samples/sec: 7517.83 - lr: 0.050000
2023-04-06 01:06:33,942 epoch 47 - iter 1855/2650 - loss 0.02605782 - time (sec): 136.15 - samples/sec: 7524.67 - lr: 0.050000
2023-04-06 01:06:53,258 epoch 47 - iter 2120/2650 - loss 0.02630314 - time (sec): 155.46 - samples/sec: 7519.38 - lr: 0.050000
2023-04-06 01:07:12,382 epoch 47 - iter 2385/2650 - loss 0.02634488 - time (sec): 174.59 - samples/sec: 7530.39 - lr: 0.050000
2023-04-06 01:07:32,130 epoch 47 - iter 2650/2650 - loss 0.02638097 - time (sec): 194.33 - samples/sec: 7525.04 - lr: 0.050000
2023-04-06 01:07:32,130 ----------------------------------------------------------------------------------------------------
2023-04-06 01:07:32,130 EPOCH 47 done: loss 0.0264 - lr 0.050000
2023-04-06 01:07:32,130 BAD EPOCHS (no improvement): 0
2023-04-06 01:07:32,134 ----------------------------------------------------------------------------------------------------
2023-04-06 01:07:51,165 epoch 48 - iter 265/2650 - loss 0.02497006 - time (sec): 19.03 - samples/sec: 7708.24 - lr: 0.050000
2023-04-06 01:08:10,521 epoch 48 - iter 530/2650 - loss 0.02537864 - time (sec): 38.39 - samples/sec: 7652.81 - lr: 0.050000
2023-04-06 01:08:30,134 epoch 48 - iter 795/2650 - loss 0.02586341 - time (sec): 58.00 - samples/sec: 7601.50 - lr: 0.050000
2023-04-06 01:08:49,049 epoch 48 - iter 1060/2650 - loss 0.02610405 - time (sec): 76.91 - samples/sec: 7624.61 - lr: 0.050000
2023-04-06 01:09:08,782 epoch 48 - iter 1325/2650 - loss 0.02613025 - time (sec): 96.65 - samples/sec: 7585.54 - lr: 0.050000
2023-04-06 01:09:27,998 epoch 48 - iter 1590/2650 - loss 0.02639868 - time (sec): 115.86 - samples/sec: 7587.82 - lr: 0.050000
2023-04-06 01:09:47,645 epoch 48 - iter 1855/2650 - loss 0.02665686 - time (sec): 135.51 - samples/sec: 7557.65 - lr: 0.050000
2023-04-06 01:10:07,062 epoch 48 - iter 2120/2650 - loss 0.02648754 - time (sec): 154.93 - samples/sec: 7560.05 - lr: 0.050000
2023-04-06 01:10:26,239 epoch 48 - iter 2385/2650 - loss 0.02640973 - time (sec): 174.10 - samples/sec: 7562.91 - lr: 0.050000
2023-04-06 01:10:45,825 epoch 48 - iter 2650/2650 - loss 0.02644546 - time (sec): 193.69 - samples/sec: 7549.98 - lr: 0.050000
2023-04-06 01:10:45,825 ----------------------------------------------------------------------------------------------------
2023-04-06 01:10:45,825 EPOCH 48 done: loss 0.0264 - lr 0.050000
2023-04-06 01:10:45,825 BAD EPOCHS (no improvement): 1
2023-04-06 01:10:45,828 ----------------------------------------------------------------------------------------------------
2023-04-06 01:11:04,470 epoch 49 - iter 265/2650 - loss 0.02572595 - time (sec): 18.64 - samples/sec: 7716.66 - lr: 0.050000
2023-04-06 01:11:24,007 epoch 49 - iter 530/2650 - loss 0.02552120 - time (sec): 38.18 - samples/sec: 7602.09 - lr: 0.050000
2023-04-06 01:11:43,480 epoch 49 - iter 795/2650 - loss 0.02633131 - time (sec): 57.65 - samples/sec: 7576.99 - lr: 0.050000
2023-04-06 01:12:02,339 epoch 49 - iter 1060/2650 - loss 0.02598872 - time (sec): 76.51 - samples/sec: 7590.27 - lr: 0.050000
2023-04-06 01:12:22,289 epoch 49 - iter 1325/2650 - loss 0.02601439 - time (sec): 96.46 - samples/sec: 7541.19 - lr: 0.050000
2023-04-06 01:12:42,730 epoch 49 - iter 1590/2650 - loss 0.02620471 - time (sec): 116.90 - samples/sec: 7492.66 - lr: 0.050000
2023-04-06 01:13:01,883 epoch 49 - iter 1855/2650 - loss 0.02615302 - time (sec): 136.05 - samples/sec: 7516.21 - lr: 0.050000
2023-04-06 01:13:20,819 epoch 49 - iter 2120/2650 - loss 0.02615157 - time (sec): 154.99 - samples/sec: 7522.12 - lr: 0.050000
2023-04-06 01:13:40,282 epoch 49 - iter 2385/2650 - loss 0.02609305 - time (sec): 174.45 - samples/sec: 7535.30 - lr: 0.050000
2023-04-06 01:13:59,805 epoch 49 - iter 2650/2650 - loss 0.02602727 - time (sec): 193.98 - samples/sec: 7538.88 - lr: 0.050000
2023-04-06 01:13:59,805 ----------------------------------------------------------------------------------------------------
2023-04-06 01:13:59,805 EPOCH 49 done: loss 0.0260 - lr 0.050000
2023-04-06 01:13:59,805 BAD EPOCHS (no improvement): 0
2023-04-06 01:13:59,808 ----------------------------------------------------------------------------------------------------
2023-04-06 01:14:19,314 epoch 50 - iter 265/2650 - loss 0.02565035 - time (sec): 19.51 - samples/sec: 7508.88 - lr: 0.050000
2023-04-06 01:14:38,162 epoch 50 - iter 530/2650 - loss 0.02543204 - time (sec): 38.35 - samples/sec: 7571.96 - lr: 0.050000
2023-04-06 01:14:57,197 epoch 50 - iter 795/2650 - loss 0.02555477 - time (sec): 57.39 - samples/sec: 7615.21 - lr: 0.050000
2023-04-06 01:15:16,424 epoch 50 - iter 1060/2650 - loss 0.02503155 - time (sec): 76.62 - samples/sec: 7592.32 - lr: 0.050000
2023-04-06 01:15:35,860 epoch 50 - iter 1325/2650 - loss 0.02525104 - time (sec): 96.05 - samples/sec: 7584.93 - lr: 0.050000
2023-04-06 01:15:55,505 epoch 50 - iter 1590/2650 - loss 0.02528522 - time (sec): 115.70 - samples/sec: 7579.38 - lr: 0.050000
2023-04-06 01:16:14,798 epoch 50 - iter 1855/2650 - loss 0.02549004 - time (sec): 134.99 - samples/sec: 7577.29 - lr: 0.050000
2023-04-06 01:16:34,439 epoch 50 - iter 2120/2650 - loss 0.02573239 - time (sec): 154.63 - samples/sec: 7560.74 - lr: 0.050000
2023-04-06 01:16:54,084 epoch 50 - iter 2385/2650 - loss 0.02561036 - time (sec): 174.28 - samples/sec: 7552.73 - lr: 0.050000
2023-04-06 01:17:13,721 epoch 50 - iter 2650/2650 - loss 0.02588396 - time (sec): 193.91 - samples/sec: 7541.35 - lr: 0.050000
2023-04-06 01:17:13,722 ----------------------------------------------------------------------------------------------------
2023-04-06 01:17:13,722 EPOCH 50 done: loss 0.0259 - lr 0.050000
2023-04-06 01:17:13,722 BAD EPOCHS (no improvement): 0
2023-04-06 01:17:13,725 ----------------------------------------------------------------------------------------------------
2023-04-06 01:17:33,203 epoch 51 - iter 265/2650 - loss 0.02476935 - time (sec): 19.48 - samples/sec: 7500.10 - lr: 0.050000
2023-04-06 01:17:52,079 epoch 51 - iter 530/2650 - loss 0.02463422 - time (sec): 38.35 - samples/sec: 7586.37 - lr: 0.050000
2023-04-06 01:18:11,377 epoch 51 - iter 795/2650 - loss 0.02502662 - time (sec): 57.65 - samples/sec: 7590.55 - lr: 0.050000
2023-04-06 01:18:30,412 epoch 51 - iter 1060/2650 - loss 0.02513068 - time (sec): 76.69 - samples/sec: 7601.60 - lr: 0.050000
2023-04-06 01:18:49,768 epoch 51 - iter 1325/2650 - loss 0.02536481 - time (sec): 96.04 - samples/sec: 7604.77 - lr: 0.050000
2023-04-06 01:19:08,895 epoch 51 - iter 1590/2650 - loss 0.02530856 - time (sec): 115.17 - samples/sec: 7607.85 - lr: 0.050000
2023-04-06 01:19:28,314 epoch 51 - iter 1855/2650 - loss 0.02515549 - time (sec): 134.59 - samples/sec: 7599.49 - lr: 0.050000
2023-04-06 01:19:48,204 epoch 51 - iter 2120/2650 - loss 0.02537275 - time (sec): 154.48 - samples/sec: 7572.89 - lr: 0.050000
2023-04-06 01:20:07,558 epoch 51 - iter 2385/2650 - loss 0.02531388 - time (sec): 173.83 - samples/sec: 7571.59 - lr: 0.050000
2023-04-06 01:20:26,705 epoch 51 - iter 2650/2650 - loss 0.02558449 - time (sec): 192.98 - samples/sec: 7577.82 - lr: 0.050000
2023-04-06 01:20:26,705 ----------------------------------------------------------------------------------------------------
2023-04-06 01:20:26,705 EPOCH 51 done: loss 0.0256 - lr 0.050000
2023-04-06 01:20:26,705 BAD EPOCHS (no improvement): 0
2023-04-06 01:20:26,708 ----------------------------------------------------------------------------------------------------
2023-04-06 01:20:46,533 epoch 52 - iter 265/2650 - loss 0.02428667 - time (sec): 19.82 - samples/sec: 7498.55 - lr: 0.050000
2023-04-06 01:21:06,088 epoch 52 - iter 530/2650 - loss 0.02445563 - time (sec): 39.38 - samples/sec: 7479.28 - lr: 0.050000
2023-04-06 01:21:25,191 epoch 52 - iter 795/2650 - loss 0.02421567 - time (sec): 58.48 - samples/sec: 7523.45 - lr: 0.050000
2023-04-06 01:21:44,634 epoch 52 - iter 1060/2650 - loss 0.02474226 - time (sec): 77.93 - samples/sec: 7506.65 - lr: 0.050000
2023-04-06 01:22:03,898 epoch 52 - iter 1325/2650 - loss 0.02506947 - time (sec): 97.19 - samples/sec: 7544.89 - lr: 0.050000
2023-04-06 01:22:23,454 epoch 52 - iter 1590/2650 - loss 0.02522818 - time (sec): 116.75 - samples/sec: 7548.89 - lr: 0.050000
2023-04-06 01:22:43,024 epoch 52 - iter 1855/2650 - loss 0.02536071 - time (sec): 136.32 - samples/sec: 7535.44 - lr: 0.050000
2023-04-06 01:23:02,131 epoch 52 - iter 2120/2650 - loss 0.02545367 - time (sec): 155.42 - samples/sec: 7557.21 - lr: 0.050000
2023-04-06 01:23:20,797 epoch 52 - iter 2385/2650 - loss 0.02536263 - time (sec): 174.09 - samples/sec: 7577.26 - lr: 0.050000
2023-04-06 01:23:39,861 epoch 52 - iter 2650/2650 - loss 0.02535702 - time (sec): 193.15 - samples/sec: 7571.02 - lr: 0.050000
2023-04-06 01:23:39,861 ----------------------------------------------------------------------------------------------------
2023-04-06 01:23:39,861 EPOCH 52 done: loss 0.0254 - lr 0.050000
2023-04-06 01:23:39,861 BAD EPOCHS (no improvement): 0
2023-04-06 01:23:39,864 ----------------------------------------------------------------------------------------------------
2023-04-06 01:23:59,168 epoch 53 - iter 265/2650 - loss 0.02384415 - time (sec): 19.30 - samples/sec: 7587.55 - lr: 0.050000
2023-04-06 01:24:18,424 epoch 53 - iter 530/2650 - loss 0.02477392 - time (sec): 38.56 - samples/sec: 7568.35 - lr: 0.050000
2023-04-06 01:24:37,042 epoch 53 - iter 795/2650 - loss 0.02478839 - time (sec): 57.18 - samples/sec: 7644.22 - lr: 0.050000
2023-04-06 01:24:56,776 epoch 53 - iter 1060/2650 - loss 0.02494889 - time (sec): 76.91 - samples/sec: 7597.85 - lr: 0.050000
2023-04-06 01:25:16,271 epoch 53 - iter 1325/2650 - loss 0.02480454 - time (sec): 96.41 - samples/sec: 7601.60 - lr: 0.050000
2023-04-06 01:25:35,661 epoch 53 - iter 1590/2650 - loss 0.02468777 - time (sec): 115.80 - samples/sec: 7580.97 - lr: 0.050000
2023-04-06 01:25:54,928 epoch 53 - iter 1855/2650 - loss 0.02499293 - time (sec): 135.06 - samples/sec: 7572.97 - lr: 0.050000
2023-04-06 01:26:13,712 epoch 53 - iter 2120/2650 - loss 0.02510099 - time (sec): 153.85 - samples/sec: 7593.36 - lr: 0.050000
2023-04-06 01:26:33,163 epoch 53 - iter 2385/2650 - loss 0.02489719 - time (sec): 173.30 - samples/sec: 7587.98 - lr: 0.050000
2023-04-06 01:26:53,182 epoch 53 - iter 2650/2650 - loss 0.02507869 - time (sec): 193.32 - samples/sec: 7564.54 - lr: 0.050000
2023-04-06 01:26:53,183 ----------------------------------------------------------------------------------------------------
2023-04-06 01:26:53,183 EPOCH 53 done: loss 0.0251 - lr 0.050000
2023-04-06 01:26:53,183 BAD EPOCHS (no improvement): 0
2023-04-06 01:26:53,186 ----------------------------------------------------------------------------------------------------
2023-04-06 01:27:12,263 epoch 54 - iter 265/2650 - loss 0.02404935 - time (sec): 19.08 - samples/sec: 7668.36 - lr: 0.050000
2023-04-06 01:27:31,619 epoch 54 - iter 530/2650 - loss 0.02523240 - time (sec): 38.43 - samples/sec: 7617.33 - lr: 0.050000
2023-04-06 01:27:51,094 epoch 54 - iter 795/2650 - loss 0.02507307 - time (sec): 57.91 - samples/sec: 7607.61 - lr: 0.050000
2023-04-06 01:28:11,262 epoch 54 - iter 1060/2650 - loss 0.02508579 - time (sec): 78.08 - samples/sec: 7537.86 - lr: 0.050000
2023-04-06 01:28:30,719 epoch 54 - iter 1325/2650 - loss 0.02516602 - time (sec): 97.53 - samples/sec: 7549.15 - lr: 0.050000
2023-04-06 01:28:50,363 epoch 54 - iter 1590/2650 - loss 0.02526335 - time (sec): 117.18 - samples/sec: 7526.76 - lr: 0.050000
2023-04-06 01:29:08,929 epoch 54 - iter 1855/2650 - loss 0.02520349 - time (sec): 135.74 - samples/sec: 7550.96 - lr: 0.050000
2023-04-06 01:29:28,607 epoch 54 - iter 2120/2650 - loss 0.02535713 - time (sec): 155.42 - samples/sec: 7542.01 - lr: 0.050000
2023-04-06 01:29:47,436 epoch 54 - iter 2385/2650 - loss 0.02533769 - time (sec): 174.25 - samples/sec: 7550.47 - lr: 0.050000
2023-04-06 01:30:06,963 epoch 54 - iter 2650/2650 - loss 0.02533762 - time (sec): 193.78 - samples/sec: 7546.64 - lr: 0.050000
2023-04-06 01:30:06,964 ----------------------------------------------------------------------------------------------------
2023-04-06 01:30:06,964 EPOCH 54 done: loss 0.0253 - lr 0.050000
2023-04-06 01:30:06,964 BAD EPOCHS (no improvement): 1
2023-04-06 01:30:06,968 ----------------------------------------------------------------------------------------------------
2023-04-06 01:30:25,678 epoch 55 - iter 265/2650 - loss 0.02445691 - time (sec): 18.71 - samples/sec: 7748.94 - lr: 0.050000
2023-04-06 01:30:44,548 epoch 55 - iter 530/2650 - loss 0.02469012 - time (sec): 37.58 - samples/sec: 7732.49 - lr: 0.050000
2023-04-06 01:31:04,063 epoch 55 - iter 795/2650 - loss 0.02431063 - time (sec): 57.10 - samples/sec: 7675.71 - lr: 0.050000
2023-04-06 01:31:24,143 epoch 55 - iter 1060/2650 - loss 0.02403691 - time (sec): 77.18 - samples/sec: 7587.05 - lr: 0.050000
2023-04-06 01:31:43,149 epoch 55 - iter 1325/2650 - loss 0.02393999 - time (sec): 96.18 - samples/sec: 7591.61 - lr: 0.050000
2023-04-06 01:32:02,452 epoch 55 - iter 1590/2650 - loss 0.02432887 - time (sec): 115.48 - samples/sec: 7597.23 - lr: 0.050000
2023-04-06 01:32:22,099 epoch 55 - iter 1855/2650 - loss 0.02429997 - time (sec): 135.13 - samples/sec: 7571.78 - lr: 0.050000
2023-04-06 01:32:40,968 epoch 55 - iter 2120/2650 - loss 0.02442207 - time (sec): 154.00 - samples/sec: 7583.90 - lr: 0.050000
2023-04-06 01:33:00,509 epoch 55 - iter 2385/2650 - loss 0.02432777 - time (sec): 173.54 - samples/sec: 7574.46 - lr: 0.050000
2023-04-06 01:33:20,420 epoch 55 - iter 2650/2650 - loss 0.02436127 - time (sec): 193.45 - samples/sec: 7559.28 - lr: 0.050000
2023-04-06 01:33:20,421 ----------------------------------------------------------------------------------------------------
2023-04-06 01:33:20,421 EPOCH 55 done: loss 0.0244 - lr 0.050000
2023-04-06 01:33:20,421 BAD EPOCHS (no improvement): 0
2023-04-06 01:33:20,425 ----------------------------------------------------------------------------------------------------
2023-04-06 01:33:39,910 epoch 56 - iter 265/2650 - loss 0.02377505 - time (sec): 19.49 - samples/sec: 7489.99 - lr: 0.050000
2023-04-06 01:33:59,324 epoch 56 - iter 530/2650 - loss 0.02438565 - time (sec): 38.90 - samples/sec: 7519.72 - lr: 0.050000
2023-04-06 01:34:18,821 epoch 56 - iter 795/2650 - loss 0.02383171 - time (sec): 58.40 - samples/sec: 7519.05 - lr: 0.050000
2023-04-06 01:34:38,703 epoch 56 - iter 1060/2650 - loss 0.02403599 - time (sec): 78.28 - samples/sec: 7485.78 - lr: 0.050000
2023-04-06 01:34:57,585 epoch 56 - iter 1325/2650 - loss 0.02387954 - time (sec): 97.16 - samples/sec: 7515.97 - lr: 0.050000
2023-04-06 01:35:17,090 epoch 56 - iter 1590/2650 - loss 0.02397118 - time (sec): 116.67 - samples/sec: 7514.80 - lr: 0.050000
2023-04-06 01:35:36,055 epoch 56 - iter 1855/2650 - loss 0.02403955 - time (sec): 135.63 - samples/sec: 7542.35 - lr: 0.050000
2023-04-06 01:35:55,238 epoch 56 - iter 2120/2650 - loss 0.02403829 - time (sec): 154.81 - samples/sec: 7555.59 - lr: 0.050000
2023-04-06 01:36:14,399 epoch 56 - iter 2385/2650 - loss 0.02412982 - time (sec): 173.97 - samples/sec: 7557.59 - lr: 0.050000
2023-04-06 01:36:33,828 epoch 56 - iter 2650/2650 - loss 0.02428701 - time (sec): 193.40 - samples/sec: 7561.22 - lr: 0.050000
2023-04-06 01:36:33,828 ----------------------------------------------------------------------------------------------------
2023-04-06 01:36:33,828 EPOCH 56 done: loss 0.0243 - lr 0.050000
2023-04-06 01:36:33,828 BAD EPOCHS (no improvement): 0
2023-04-06 01:36:33,832 ----------------------------------------------------------------------------------------------------
2023-04-06 01:36:53,152 epoch 57 - iter 265/2650 - loss 0.02398013 - time (sec): 19.32 - samples/sec: 7533.07 - lr: 0.050000
2023-04-06 01:37:12,591 epoch 57 - iter 530/2650 - loss 0.02429248 - time (sec): 38.76 - samples/sec: 7542.41 - lr: 0.050000
2023-04-06 01:37:32,172 epoch 57 - iter 795/2650 - loss 0.02418072 - time (sec): 58.34 - samples/sec: 7538.34 - lr: 0.050000
2023-04-06 01:37:51,594 epoch 57 - iter 1060/2650 - loss 0.02418512 - time (sec): 77.76 - samples/sec: 7538.48 - lr: 0.050000
2023-04-06 01:38:10,575 epoch 57 - iter 1325/2650 - loss 0.02406297 - time (sec): 96.74 - samples/sec: 7565.41 - lr: 0.050000
2023-04-06 01:38:29,560 epoch 57 - iter 1590/2650 - loss 0.02409748 - time (sec): 115.73 - samples/sec: 7583.74 - lr: 0.050000
2023-04-06 01:38:48,964 epoch 57 - iter 1855/2650 - loss 0.02398304 - time (sec): 135.13 - samples/sec: 7579.53 - lr: 0.050000
2023-04-06 01:39:08,450 epoch 57 - iter 2120/2650 - loss 0.02405014 - time (sec): 154.62 - samples/sec: 7571.11 - lr: 0.050000
2023-04-06 01:39:27,959 epoch 57 - iter 2385/2650 - loss 0.02400264 - time (sec): 174.13 - samples/sec: 7564.21 - lr: 0.050000
2023-04-06 01:39:46,995 epoch 57 - iter 2650/2650 - loss 0.02417812 - time (sec): 193.16 - samples/sec: 7570.62 - lr: 0.050000
2023-04-06 01:39:46,995 ----------------------------------------------------------------------------------------------------
2023-04-06 01:39:46,995 EPOCH 57 done: loss 0.0242 - lr 0.050000
2023-04-06 01:39:46,995 BAD EPOCHS (no improvement): 0
2023-04-06 01:39:46,998 ----------------------------------------------------------------------------------------------------
2023-04-06 01:40:06,059 epoch 58 - iter 265/2650 - loss 0.02391883 - time (sec): 19.06 - samples/sec: 7554.10 - lr: 0.050000
2023-04-06 01:40:25,476 epoch 58 - iter 530/2650 - loss 0.02445235 - time (sec): 38.48 - samples/sec: 7534.71 - lr: 0.050000
2023-04-06 01:40:44,829 epoch 58 - iter 795/2650 - loss 0.02420618 - time (sec): 57.83 - samples/sec: 7572.99 - lr: 0.050000
2023-04-06 01:41:03,885 epoch 58 - iter 1060/2650 - loss 0.02416510 - time (sec): 76.89 - samples/sec: 7586.44 - lr: 0.050000
2023-04-06 01:41:23,497 epoch 58 - iter 1325/2650 - loss 0.02420534 - time (sec): 96.50 - samples/sec: 7549.23 - lr: 0.050000
2023-04-06 01:41:43,225 epoch 58 - iter 1590/2650 - loss 0.02406354 - time (sec): 116.23 - samples/sec: 7528.95 - lr: 0.050000
2023-04-06 01:42:03,252 epoch 58 - iter 1855/2650 - loss 0.02407472 - time (sec): 136.25 - samples/sec: 7512.06 - lr: 0.050000
2023-04-06 01:42:22,191 epoch 58 - iter 2120/2650 - loss 0.02412140 - time (sec): 155.19 - samples/sec: 7537.83 - lr: 0.050000
2023-04-06 01:42:41,533 epoch 58 - iter 2385/2650 - loss 0.02397507 - time (sec): 174.53 - samples/sec: 7544.20 - lr: 0.050000
2023-04-06 01:43:00,508 epoch 58 - iter 2650/2650 - loss 0.02401095 - time (sec): 193.51 - samples/sec: 7557.06 - lr: 0.050000
2023-04-06 01:43:00,508 ----------------------------------------------------------------------------------------------------
2023-04-06 01:43:00,508 EPOCH 58 done: loss 0.0240 - lr 0.050000
2023-04-06 01:43:00,508 BAD EPOCHS (no improvement): 0
2023-04-06 01:43:00,511 ----------------------------------------------------------------------------------------------------
2023-04-06 01:43:29,743 epoch 59 - iter 265/2650 - loss 0.02401676 - time (sec): 29.23 - samples/sec: 4985.83 - lr: 0.050000
2023-04-06 01:43:49,665 epoch 59 - iter 530/2650 - loss 0.02373717 - time (sec): 49.15 - samples/sec: 5958.01 - lr: 0.050000
2023-04-06 01:44:08,998 epoch 59 - iter 795/2650 - loss 0.02367060 - time (sec): 68.49 - samples/sec: 6416.11 - lr: 0.050000
2023-04-06 01:44:27,974 epoch 59 - iter 1060/2650 - loss 0.02394443 - time (sec): 87.46 - samples/sec: 6707.21 - lr: 0.050000
2023-04-06 01:44:47,346 epoch 59 - iter 1325/2650 - loss 0.02376119 - time (sec): 106.83 - samples/sec: 6875.43 - lr: 0.050000
2023-04-06 01:45:06,237 epoch 59 - iter 1590/2650 - loss 0.02397139 - time (sec): 125.73 - samples/sec: 6993.14 - lr: 0.050000
2023-04-06 01:45:25,520 epoch 59 - iter 1855/2650 - loss 0.02410068 - time (sec): 145.01 - samples/sec: 7072.06 - lr: 0.050000
2023-04-06 01:45:44,796 epoch 59 - iter 2120/2650 - loss 0.02416227 - time (sec): 164.28 - samples/sec: 7131.06 - lr: 0.050000
2023-04-06 01:46:04,055 epoch 59 - iter 2385/2650 - loss 0.02427166 - time (sec): 183.54 - samples/sec: 7171.22 - lr: 0.050000
2023-04-06 01:46:23,607 epoch 59 - iter 2650/2650 - loss 0.02429694 - time (sec): 203.10 - samples/sec: 7200.36 - lr: 0.050000
2023-04-06 01:46:23,608 ----------------------------------------------------------------------------------------------------
2023-04-06 01:46:23,608 EPOCH 59 done: loss 0.0243 - lr 0.050000
2023-04-06 01:46:23,608 BAD EPOCHS (no improvement): 1
2023-04-06 01:46:23,612 ----------------------------------------------------------------------------------------------------
2023-04-06 01:46:43,440 epoch 60 - iter 265/2650 - loss 0.02377375 - time (sec): 19.83 - samples/sec: 7434.15 - lr: 0.050000
2023-04-06 01:47:02,502 epoch 60 - iter 530/2650 - loss 0.02425963 - time (sec): 38.89 - samples/sec: 7541.74 - lr: 0.050000
2023-04-06 01:47:21,936 epoch 60 - iter 795/2650 - loss 0.02365385 - time (sec): 58.32 - samples/sec: 7510.84 - lr: 0.050000
2023-04-06 01:47:41,656 epoch 60 - iter 1060/2650 - loss 0.02378961 - time (sec): 78.04 - samples/sec: 7487.93 - lr: 0.050000
2023-04-06 01:48:01,510 epoch 60 - iter 1325/2650 - loss 0.02388184 - time (sec): 97.90 - samples/sec: 7474.98 - lr: 0.050000
2023-04-06 01:48:20,827 epoch 60 - iter 1590/2650 - loss 0.02398835 - time (sec): 117.22 - samples/sec: 7487.91 - lr: 0.050000
2023-04-06 01:48:39,680 epoch 60 - iter 1855/2650 - loss 0.02396476 - time (sec): 136.07 - samples/sec: 7523.92 - lr: 0.050000
2023-04-06 01:48:59,119 epoch 60 - iter 2120/2650 - loss 0.02391949 - time (sec): 155.51 - samples/sec: 7528.10 - lr: 0.050000
2023-04-06 01:49:18,645 epoch 60 - iter 2385/2650 - loss 0.02394978 - time (sec): 175.03 - samples/sec: 7521.83 - lr: 0.050000
2023-04-06 01:49:37,311 epoch 60 - iter 2650/2650 - loss 0.02405185 - time (sec): 193.70 - samples/sec: 7549.66 - lr: 0.050000
2023-04-06 01:49:37,311 ----------------------------------------------------------------------------------------------------
2023-04-06 01:49:37,311 EPOCH 60 done: loss 0.0241 - lr 0.050000
2023-04-06 01:49:37,311 BAD EPOCHS (no improvement): 2
2023-04-06 01:49:37,314 ----------------------------------------------------------------------------------------------------
2023-04-06 01:49:57,331 epoch 61 - iter 265/2650 - loss 0.02314493 - time (sec): 20.02 - samples/sec: 7346.36 - lr: 0.050000
2023-04-06 01:50:16,962 epoch 61 - iter 530/2650 - loss 0.02354502 - time (sec): 39.65 - samples/sec: 7428.09 - lr: 0.050000
2023-04-06 01:50:36,953 epoch 61 - iter 795/2650 - loss 0.02334702 - time (sec): 59.64 - samples/sec: 7399.68 - lr: 0.050000
2023-04-06 01:50:55,747 epoch 61 - iter 1060/2650 - loss 0.02314119 - time (sec): 78.43 - samples/sec: 7481.04 - lr: 0.050000
2023-04-06 01:51:14,671 epoch 61 - iter 1325/2650 - loss 0.02323366 - time (sec): 97.36 - samples/sec: 7519.98 - lr: 0.050000
2023-04-06 01:51:33,426 epoch 61 - iter 1590/2650 - loss 0.02328936 - time (sec): 116.11 - samples/sec: 7552.74 - lr: 0.050000
2023-04-06 01:51:52,857 epoch 61 - iter 1855/2650 - loss 0.02346059 - time (sec): 135.54 - samples/sec: 7560.67 - lr: 0.050000
2023-04-06 01:52:11,964 epoch 61 - iter 2120/2650 - loss 0.02365344 - time (sec): 154.65 - samples/sec: 7564.04 - lr: 0.050000
2023-04-06 01:52:31,516 epoch 61 - iter 2385/2650 - loss 0.02360404 - time (sec): 174.20 - samples/sec: 7562.99 - lr: 0.050000
2023-04-06 01:52:50,450 epoch 61 - iter 2650/2650 - loss 0.02365696 - time (sec): 193.14 - samples/sec: 7571.68 - lr: 0.050000
2023-04-06 01:52:50,451 ----------------------------------------------------------------------------------------------------
2023-04-06 01:52:50,451 EPOCH 61 done: loss 0.0237 - lr 0.050000
2023-04-06 01:52:50,451 BAD EPOCHS (no improvement): 0
2023-04-06 01:52:50,454 ----------------------------------------------------------------------------------------------------
2023-04-06 01:53:09,375 epoch 62 - iter 265/2650 - loss 0.02199520 - time (sec): 18.92 - samples/sec: 7697.59 - lr: 0.050000
2023-04-06 01:53:28,458 epoch 62 - iter 530/2650 - loss 0.02213204 - time (sec): 38.00 - samples/sec: 7643.91 - lr: 0.050000
2023-04-06 01:53:48,180 epoch 62 - iter 795/2650 - loss 0.02257149 - time (sec): 57.73 - samples/sec: 7583.14 - lr: 0.050000
2023-04-06 01:54:07,794 epoch 62 - iter 1060/2650 - loss 0.02305612 - time (sec): 77.34 - samples/sec: 7560.76 - lr: 0.050000
2023-04-06 01:54:27,055 epoch 62 - iter 1325/2650 - loss 0.02306570 - time (sec): 96.60 - samples/sec: 7567.68 - lr: 0.050000
2023-04-06 01:54:46,278 epoch 62 - iter 1590/2650 - loss 0.02325935 - time (sec): 115.82 - samples/sec: 7570.73 - lr: 0.050000
2023-04-06 01:55:05,033 epoch 62 - iter 1855/2650 - loss 0.02333108 - time (sec): 134.58 - samples/sec: 7591.17 - lr: 0.050000
2023-04-06 01:55:24,646 epoch 62 - iter 2120/2650 - loss 0.02324183 - time (sec): 154.19 - samples/sec: 7580.73 - lr: 0.050000
2023-04-06 01:55:43,810 epoch 62 - iter 2385/2650 - loss 0.02313471 - time (sec): 173.36 - samples/sec: 7586.10 - lr: 0.050000
2023-04-06 01:56:03,592 epoch 62 - iter 2650/2650 - loss 0.02312860 - time (sec): 193.14 - samples/sec: 7571.58 - lr: 0.050000
2023-04-06 01:56:03,593 ----------------------------------------------------------------------------------------------------
2023-04-06 01:56:03,593 EPOCH 62 done: loss 0.0231 - lr 0.050000
2023-04-06 01:56:03,593 BAD EPOCHS (no improvement): 0
2023-04-06 01:56:03,596 ----------------------------------------------------------------------------------------------------
2023-04-06 01:56:22,959 epoch 63 - iter 265/2650 - loss 0.02258374 - time (sec): 19.36 - samples/sec: 7544.21 - lr: 0.050000
2023-04-06 01:56:41,606 epoch 63 - iter 530/2650 - loss 0.02268069 - time (sec): 38.01 - samples/sec: 7580.64 - lr: 0.050000
2023-04-06 01:57:00,821 epoch 63 - iter 795/2650 - loss 0.02266222 - time (sec): 57.22 - samples/sec: 7611.11 - lr: 0.050000
2023-04-06 01:57:19,730 epoch 63 - iter 1060/2650 - loss 0.02299915 - time (sec): 76.13 - samples/sec: 7638.57 - lr: 0.050000
2023-04-06 01:57:39,117 epoch 63 - iter 1325/2650 - loss 0.02301035 - time (sec): 95.52 - samples/sec: 7620.75 - lr: 0.050000
2023-04-06 01:57:59,259 epoch 63 - iter 1590/2650 - loss 0.02327957 - time (sec): 115.66 - samples/sec: 7567.01 - lr: 0.050000
2023-04-06 01:58:18,795 epoch 63 - iter 1855/2650 - loss 0.02326396 - time (sec): 135.20 - samples/sec: 7564.16 - lr: 0.050000
2023-04-06 01:58:38,026 epoch 63 - iter 2120/2650 - loss 0.02342097 - time (sec): 154.43 - samples/sec: 7575.37 - lr: 0.050000
2023-04-06 01:58:57,439 epoch 63 - iter 2385/2650 - loss 0.02347558 - time (sec): 173.84 - samples/sec: 7573.20 - lr: 0.050000
2023-04-06 01:59:16,838 epoch 63 - iter 2650/2650 - loss 0.02347955 - time (sec): 193.24 - samples/sec: 7567.53 - lr: 0.050000
2023-04-06 01:59:16,838 ----------------------------------------------------------------------------------------------------
2023-04-06 01:59:16,838 EPOCH 63 done: loss 0.0235 - lr 0.050000
2023-04-06 01:59:16,838 BAD EPOCHS (no improvement): 1
2023-04-06 01:59:16,842 ----------------------------------------------------------------------------------------------------
2023-04-06 01:59:35,954 epoch 64 - iter 265/2650 - loss 0.02286926 - time (sec): 19.11 - samples/sec: 7552.47 - lr: 0.050000
2023-04-06 01:59:55,675 epoch 64 - iter 530/2650 - loss 0.02258219 - time (sec): 38.83 - samples/sec: 7523.03 - lr: 0.050000
2023-04-06 02:00:15,374 epoch 64 - iter 795/2650 - loss 0.02226003 - time (sec): 58.53 - samples/sec: 7498.28 - lr: 0.050000
2023-04-06 02:00:34,600 epoch 64 - iter 1060/2650 - loss 0.02266443 - time (sec): 77.76 - samples/sec: 7534.54 - lr: 0.050000
2023-04-06 02:00:53,993 epoch 64 - iter 1325/2650 - loss 0.02277547 - time (sec): 97.15 - samples/sec: 7517.59 - lr: 0.050000
2023-04-06 02:01:13,129 epoch 64 - iter 1590/2650 - loss 0.02285390 - time (sec): 116.29 - samples/sec: 7538.17 - lr: 0.050000
2023-04-06 02:01:32,920 epoch 64 - iter 1855/2650 - loss 0.02273884 - time (sec): 136.08 - samples/sec: 7528.09 - lr: 0.050000
2023-04-06 02:01:52,455 epoch 64 - iter 2120/2650 - loss 0.02292576 - time (sec): 155.61 - samples/sec: 7523.95 - lr: 0.050000
2023-04-06 02:02:11,476 epoch 64 - iter 2385/2650 - loss 0.02302581 - time (sec): 174.63 - samples/sec: 7537.45 - lr: 0.050000
2023-04-06 02:02:30,604 epoch 64 - iter 2650/2650 - loss 0.02311827 - time (sec): 193.76 - samples/sec: 7547.21 - lr: 0.050000
2023-04-06 02:02:30,605 ----------------------------------------------------------------------------------------------------
2023-04-06 02:02:30,605 EPOCH 64 done: loss 0.0231 - lr 0.050000
2023-04-06 02:02:30,605 BAD EPOCHS (no improvement): 0
2023-04-06 02:02:30,608 ----------------------------------------------------------------------------------------------------
2023-04-06 02:02:49,959 epoch 65 - iter 265/2650 - loss 0.02145897 - time (sec): 19.35 - samples/sec: 7592.50 - lr: 0.050000
2023-04-06 02:03:09,235 epoch 65 - iter 530/2650 - loss 0.02177615 - time (sec): 38.63 - samples/sec: 7632.40 - lr: 0.050000
2023-04-06 02:03:28,707 epoch 65 - iter 795/2650 - loss 0.02203147 - time (sec): 58.10 - samples/sec: 7592.61 - lr: 0.050000
2023-04-06 02:03:48,187 epoch 65 - iter 1060/2650 - loss 0.02216671 - time (sec): 77.58 - samples/sec: 7582.59 - lr: 0.050000
2023-04-06 02:04:07,463 epoch 65 - iter 1325/2650 - loss 0.02224406 - time (sec): 96.85 - samples/sec: 7574.96 - lr: 0.050000
2023-04-06 02:04:27,519 epoch 65 - iter 1590/2650 - loss 0.02235511 - time (sec): 116.91 - samples/sec: 7541.14 - lr: 0.050000
2023-04-06 02:04:46,972 epoch 65 - iter 1855/2650 - loss 0.02269270 - time (sec): 136.36 - samples/sec: 7535.08 - lr: 0.050000
2023-04-06 02:05:05,650 epoch 65 - iter 2120/2650 - loss 0.02296573 - time (sec): 155.04 - samples/sec: 7553.61 - lr: 0.050000
2023-04-06 02:05:24,696 epoch 65 - iter 2385/2650 - loss 0.02303537 - time (sec): 174.09 - samples/sec: 7561.03 - lr: 0.050000
2023-04-06 02:05:44,097 epoch 65 - iter 2650/2650 - loss 0.02305247 - time (sec): 193.49 - samples/sec: 7557.88 - lr: 0.050000
2023-04-06 02:05:44,097 ----------------------------------------------------------------------------------------------------
2023-04-06 02:05:44,098 EPOCH 65 done: loss 0.0231 - lr 0.050000
2023-04-06 02:05:44,098 BAD EPOCHS (no improvement): 0
2023-04-06 02:05:44,104 ----------------------------------------------------------------------------------------------------
2023-04-06 02:06:03,358 epoch 66 - iter 265/2650 - loss 0.02208292 - time (sec): 19.25 - samples/sec: 7588.07 - lr: 0.050000
2023-04-06 02:06:22,953 epoch 66 - iter 530/2650 - loss 0.02263528 - time (sec): 38.85 - samples/sec: 7542.83 - lr: 0.050000
2023-04-06 02:06:42,879 epoch 66 - iter 795/2650 - loss 0.02226817 - time (sec): 58.77 - samples/sec: 7504.66 - lr: 0.050000
2023-04-06 02:07:02,304 epoch 66 - iter 1060/2650 - loss 0.02280661 - time (sec): 78.20 - samples/sec: 7498.00 - lr: 0.050000
2023-04-06 02:07:21,646 epoch 66 - iter 1325/2650 - loss 0.02293911 - time (sec): 97.54 - samples/sec: 7509.08 - lr: 0.050000
2023-04-06 02:07:41,504 epoch 66 - iter 1590/2650 - loss 0.02291120 - time (sec): 117.40 - samples/sec: 7479.78 - lr: 0.050000
2023-04-06 02:08:00,381 epoch 66 - iter 1855/2650 - loss 0.02294978 - time (sec): 136.28 - samples/sec: 7515.20 - lr: 0.050000
2023-04-06 02:08:19,847 epoch 66 - iter 2120/2650 - loss 0.02298986 - time (sec): 155.74 - samples/sec: 7508.62 - lr: 0.050000
2023-04-06 02:08:39,638 epoch 66 - iter 2385/2650 - loss 0.02288823 - time (sec): 175.53 - samples/sec: 7504.41 - lr: 0.050000
2023-04-06 02:08:58,978 epoch 66 - iter 2650/2650 - loss 0.02295924 - time (sec): 194.87 - samples/sec: 7504.13 - lr: 0.050000
2023-04-06 02:08:58,979 ----------------------------------------------------------------------------------------------------
2023-04-06 02:08:58,979 EPOCH 66 done: loss 0.0230 - lr 0.050000
2023-04-06 02:08:58,979 BAD EPOCHS (no improvement): 0
2023-04-06 02:08:58,983 ----------------------------------------------------------------------------------------------------
2023-04-06 02:09:18,543 epoch 67 - iter 265/2650 - loss 0.02270088 - time (sec): 19.56 - samples/sec: 7542.19 - lr: 0.050000
2023-04-06 02:09:38,184 epoch 67 - iter 530/2650 - loss 0.02266399 - time (sec): 39.20 - samples/sec: 7515.29 - lr: 0.050000
2023-04-06 02:09:57,194 epoch 67 - iter 795/2650 - loss 0.02250087 - time (sec): 58.21 - samples/sec: 7568.40 - lr: 0.050000
2023-04-06 02:10:16,038 epoch 67 - iter 1060/2650 - loss 0.02244498 - time (sec): 77.06 - samples/sec: 7594.56 - lr: 0.050000
2023-04-06 02:10:35,481 epoch 67 - iter 1325/2650 - loss 0.02244627 - time (sec): 96.50 - samples/sec: 7570.32 - lr: 0.050000
2023-04-06 02:10:54,900 epoch 67 - iter 1590/2650 - loss 0.02243636 - time (sec): 115.92 - samples/sec: 7551.17 - lr: 0.050000
2023-04-06 02:11:14,566 epoch 67 - iter 1855/2650 - loss 0.02249549 - time (sec): 135.58 - samples/sec: 7534.42 - lr: 0.050000
2023-04-06 02:11:34,013 epoch 67 - iter 2120/2650 - loss 0.02240109 - time (sec): 155.03 - samples/sec: 7534.93 - lr: 0.050000
2023-04-06 02:11:53,475 epoch 67 - iter 2385/2650 - loss 0.02255590 - time (sec): 174.49 - samples/sec: 7539.04 - lr: 0.050000
2023-04-06 02:12:13,168 epoch 67 - iter 2650/2650 - loss 0.02262329 - time (sec): 194.18 - samples/sec: 7530.77 - lr: 0.050000
2023-04-06 02:12:13,168 ----------------------------------------------------------------------------------------------------
2023-04-06 02:12:13,168 EPOCH 67 done: loss 0.0226 - lr 0.050000
2023-04-06 02:12:13,168 BAD EPOCHS (no improvement): 0
2023-04-06 02:12:13,172 ----------------------------------------------------------------------------------------------------
2023-04-06 02:12:32,814 epoch 68 - iter 265/2650 - loss 0.02206876 - time (sec): 19.64 - samples/sec: 7495.31 - lr: 0.050000
2023-04-06 02:12:52,286 epoch 68 - iter 530/2650 - loss 0.02203861 - time (sec): 39.11 - samples/sec: 7500.05 - lr: 0.050000
2023-04-06 02:13:12,383 epoch 68 - iter 795/2650 - loss 0.02227440 - time (sec): 59.21 - samples/sec: 7442.11 - lr: 0.050000
2023-04-06 02:13:31,791 epoch 68 - iter 1060/2650 - loss 0.02262573 - time (sec): 78.62 - samples/sec: 7470.98 - lr: 0.050000
2023-04-06 02:13:51,208 epoch 68 - iter 1325/2650 - loss 0.02273658 - time (sec): 98.04 - samples/sec: 7491.21 - lr: 0.050000
2023-04-06 02:14:10,333 epoch 68 - iter 1590/2650 - loss 0.02261569 - time (sec): 117.16 - samples/sec: 7496.80 - lr: 0.050000
2023-04-06 02:14:29,564 epoch 68 - iter 1855/2650 - loss 0.02274152 - time (sec): 136.39 - samples/sec: 7508.86 - lr: 0.050000
2023-04-06 02:14:48,371 epoch 68 - iter 2120/2650 - loss 0.02262592 - time (sec): 155.20 - samples/sec: 7519.42 - lr: 0.050000
2023-04-06 02:15:08,258 epoch 68 - iter 2385/2650 - loss 0.02274996 - time (sec): 175.09 - samples/sec: 7505.90 - lr: 0.050000
2023-04-06 02:15:27,863 epoch 68 - iter 2650/2650 - loss 0.02265242 - time (sec): 194.69 - samples/sec: 7511.21 - lr: 0.050000
2023-04-06 02:15:27,863 ----------------------------------------------------------------------------------------------------
2023-04-06 02:15:27,863 EPOCH 68 done: loss 0.0227 - lr 0.050000
2023-04-06 02:15:27,863 BAD EPOCHS (no improvement): 1
2023-04-06 02:15:27,868 ----------------------------------------------------------------------------------------------------
2023-04-06 02:15:47,380 epoch 69 - iter 265/2650 - loss 0.02158817 - time (sec): 19.51 - samples/sec: 7505.22 - lr: 0.050000
2023-04-06 02:16:06,515 epoch 69 - iter 530/2650 - loss 0.02168226 - time (sec): 38.65 - samples/sec: 7531.70 - lr: 0.050000
2023-04-06 02:16:26,586 epoch 69 - iter 795/2650 - loss 0.02192219 - time (sec): 58.72 - samples/sec: 7484.84 - lr: 0.050000
2023-04-06 02:16:45,822 epoch 69 - iter 1060/2650 - loss 0.02191979 - time (sec): 77.95 - samples/sec: 7515.55 - lr: 0.050000
2023-04-06 02:17:05,411 epoch 69 - iter 1325/2650 - loss 0.02210794 - time (sec): 97.54 - samples/sec: 7511.52 - lr: 0.050000
2023-04-06 02:17:24,756 epoch 69 - iter 1590/2650 - loss 0.02218762 - time (sec): 116.89 - samples/sec: 7524.19 - lr: 0.050000
2023-04-06 02:17:44,480 epoch 69 - iter 1855/2650 - loss 0.02234665 - time (sec): 136.61 - samples/sec: 7505.18 - lr: 0.050000
2023-04-06 02:18:03,276 epoch 69 - iter 2120/2650 - loss 0.02240301 - time (sec): 155.41 - samples/sec: 7527.95 - lr: 0.050000
2023-04-06 02:18:22,368 epoch 69 - iter 2385/2650 - loss 0.02257441 - time (sec): 174.50 - samples/sec: 7541.12 - lr: 0.050000
2023-04-06 02:18:42,046 epoch 69 - iter 2650/2650 - loss 0.02257588 - time (sec): 194.18 - samples/sec: 7531.03 - lr: 0.050000
2023-04-06 02:18:42,046 ----------------------------------------------------------------------------------------------------
2023-04-06 02:18:42,047 EPOCH 69 done: loss 0.0226 - lr 0.050000
2023-04-06 02:18:42,047 BAD EPOCHS (no improvement): 0
2023-04-06 02:18:42,050 ----------------------------------------------------------------------------------------------------
2023-04-06 02:19:01,031 epoch 70 - iter 265/2650 - loss 0.02203687 - time (sec): 18.98 - samples/sec: 7696.80 - lr: 0.050000
2023-04-06 02:19:20,687 epoch 70 - iter 530/2650 - loss 0.02185210 - time (sec): 38.64 - samples/sec: 7574.17 - lr: 0.050000
2023-04-06 02:19:39,553 epoch 70 - iter 795/2650 - loss 0.02205071 - time (sec): 57.50 - samples/sec: 7618.20 - lr: 0.050000
2023-04-06 02:19:59,726 epoch 70 - iter 1060/2650 - loss 0.02188608 - time (sec): 77.68 - samples/sec: 7535.11 - lr: 0.050000
2023-04-06 02:20:19,575 epoch 70 - iter 1325/2650 - loss 0.02202391 - time (sec): 97.52 - samples/sec: 7516.88 - lr: 0.050000
2023-04-06 02:20:39,578 epoch 70 - iter 1590/2650 - loss 0.02203704 - time (sec): 117.53 - samples/sec: 7483.12 - lr: 0.050000
2023-04-06 02:20:59,101 epoch 70 - iter 1855/2650 - loss 0.02213297 - time (sec): 137.05 - samples/sec: 7482.42 - lr: 0.050000
2023-04-06 02:21:17,841 epoch 70 - iter 2120/2650 - loss 0.02231277 - time (sec): 155.79 - samples/sec: 7514.31 - lr: 0.050000
2023-04-06 02:21:36,651 epoch 70 - iter 2385/2650 - loss 0.02240361 - time (sec): 174.60 - samples/sec: 7536.55 - lr: 0.050000
2023-04-06 02:21:56,093 epoch 70 - iter 2650/2650 - loss 0.02241748 - time (sec): 194.04 - samples/sec: 7536.27 - lr: 0.050000
2023-04-06 02:21:56,094 ----------------------------------------------------------------------------------------------------
2023-04-06 02:21:56,094 EPOCH 70 done: loss 0.0224 - lr 0.050000
2023-04-06 02:21:56,094 BAD EPOCHS (no improvement): 0
2023-04-06 02:21:56,097 ----------------------------------------------------------------------------------------------------
2023-04-06 02:22:14,933 epoch 71 - iter 265/2650 - loss 0.02190180 - time (sec): 18.84 - samples/sec: 7672.98 - lr: 0.050000
2023-04-06 02:22:34,378 epoch 71 - iter 530/2650 - loss 0.02177919 - time (sec): 38.28 - samples/sec: 7597.54 - lr: 0.050000
2023-04-06 02:22:53,376 epoch 71 - iter 795/2650 - loss 0.02250363 - time (sec): 57.28 - samples/sec: 7622.24 - lr: 0.050000
2023-04-06 02:23:12,966 epoch 71 - iter 1060/2650 - loss 0.02258949 - time (sec): 76.87 - samples/sec: 7565.81 - lr: 0.050000
2023-04-06 02:23:32,920 epoch 71 - iter 1325/2650 - loss 0.02256175 - time (sec): 96.82 - samples/sec: 7543.42 - lr: 0.050000
2023-04-06 02:23:52,879 epoch 71 - iter 1590/2650 - loss 0.02268694 - time (sec): 116.78 - samples/sec: 7522.15 - lr: 0.050000
2023-04-06 02:24:11,933 epoch 71 - iter 1855/2650 - loss 0.02257947 - time (sec): 135.84 - samples/sec: 7542.70 - lr: 0.050000
2023-04-06 02:24:31,288 epoch 71 - iter 2120/2650 - loss 0.02260316 - time (sec): 155.19 - samples/sec: 7552.96 - lr: 0.050000
2023-04-06 02:24:50,876 epoch 71 - iter 2385/2650 - loss 0.02261346 - time (sec): 174.78 - samples/sec: 7542.28 - lr: 0.050000
2023-04-06 02:25:10,271 epoch 71 - iter 2650/2650 - loss 0.02253409 - time (sec): 194.17 - samples/sec: 7531.20 - lr: 0.050000
2023-04-06 02:25:10,272 ----------------------------------------------------------------------------------------------------
2023-04-06 02:25:10,272 EPOCH 71 done: loss 0.0225 - lr 0.050000
2023-04-06 02:25:10,272 BAD EPOCHS (no improvement): 1
2023-04-06 02:25:10,275 ----------------------------------------------------------------------------------------------------
2023-04-06 02:25:29,751 epoch 72 - iter 265/2650 - loss 0.02058375 - time (sec): 19.48 - samples/sec: 7532.05 - lr: 0.050000
2023-04-06 02:25:49,234 epoch 72 - iter 530/2650 - loss 0.02161829 - time (sec): 38.96 - samples/sec: 7529.28 - lr: 0.050000
2023-04-06 02:26:08,907 epoch 72 - iter 795/2650 - loss 0.02180041 - time (sec): 58.63 - samples/sec: 7512.36 - lr: 0.050000
2023-04-06 02:26:28,079 epoch 72 - iter 1060/2650 - loss 0.02164555 - time (sec): 77.80 - samples/sec: 7543.19 - lr: 0.050000
2023-04-06 02:26:47,524 epoch 72 - iter 1325/2650 - loss 0.02125576 - time (sec): 97.25 - samples/sec: 7538.35 - lr: 0.050000
2023-04-06 02:27:06,980 epoch 72 - iter 1590/2650 - loss 0.02137661 - time (sec): 116.71 - samples/sec: 7540.21 - lr: 0.050000
2023-04-06 02:27:25,804 epoch 72 - iter 1855/2650 - loss 0.02146511 - time (sec): 135.53 - samples/sec: 7564.82 - lr: 0.050000
2023-04-06 02:27:45,028 epoch 72 - iter 2120/2650 - loss 0.02164974 - time (sec): 154.75 - samples/sec: 7559.42 - lr: 0.050000
2023-04-06 02:28:04,262 epoch 72 - iter 2385/2650 - loss 0.02186554 - time (sec): 173.99 - samples/sec: 7557.59 - lr: 0.050000
2023-04-06 02:28:23,415 epoch 72 - iter 2650/2650 - loss 0.02192431 - time (sec): 193.14 - samples/sec: 7571.54 - lr: 0.050000
2023-04-06 02:28:23,415 ----------------------------------------------------------------------------------------------------
2023-04-06 02:28:23,415 EPOCH 72 done: loss 0.0219 - lr 0.050000
2023-04-06 02:28:23,415 BAD EPOCHS (no improvement): 0
2023-04-06 02:28:23,418 ----------------------------------------------------------------------------------------------------
2023-04-06 02:28:42,660 epoch 73 - iter 265/2650 - loss 0.02040867 - time (sec): 19.24 - samples/sec: 7601.41 - lr: 0.050000
2023-04-06 02:29:02,420 epoch 73 - iter 530/2650 - loss 0.02108392 - time (sec): 39.00 - samples/sec: 7541.29 - lr: 0.050000
2023-04-06 02:29:22,037 epoch 73 - iter 795/2650 - loss 0.02132792 - time (sec): 58.62 - samples/sec: 7506.55 - lr: 0.050000
2023-04-06 02:29:41,838 epoch 73 - iter 1060/2650 - loss 0.02141061 - time (sec): 78.42 - samples/sec: 7494.64 - lr: 0.050000
2023-04-06 02:30:01,037 epoch 73 - iter 1325/2650 - loss 0.02164467 - time (sec): 97.62 - samples/sec: 7518.71 - lr: 0.050000
2023-04-06 02:30:20,164 epoch 73 - iter 1590/2650 - loss 0.02182246 - time (sec): 116.75 - samples/sec: 7524.69 - lr: 0.050000
2023-04-06 02:30:39,222 epoch 73 - iter 1855/2650 - loss 0.02194189 - time (sec): 135.80 - samples/sec: 7542.48 - lr: 0.050000
2023-04-06 02:30:58,066 epoch 73 - iter 2120/2650 - loss 0.02203676 - time (sec): 154.65 - samples/sec: 7567.84 - lr: 0.050000
2023-04-06 02:31:17,264 epoch 73 - iter 2385/2650 - loss 0.02213209 - time (sec): 173.85 - samples/sec: 7572.98 - lr: 0.050000
2023-04-06 02:31:36,473 epoch 73 - iter 2650/2650 - loss 0.02218015 - time (sec): 193.05 - samples/sec: 7574.87 - lr: 0.050000
2023-04-06 02:31:36,473 ----------------------------------------------------------------------------------------------------
2023-04-06 02:31:36,473 EPOCH 73 done: loss 0.0222 - lr 0.050000
2023-04-06 02:31:36,473 BAD EPOCHS (no improvement): 1
2023-04-06 02:31:36,476 ----------------------------------------------------------------------------------------------------
2023-04-06 02:31:56,197 epoch 74 - iter 265/2650 - loss 0.02288612 - time (sec): 19.72 - samples/sec: 7437.45 - lr: 0.050000
2023-04-06 02:32:15,648 epoch 74 - iter 530/2650 - loss 0.02176070 - time (sec): 39.17 - samples/sec: 7491.16 - lr: 0.050000
2023-04-06 02:32:35,321 epoch 74 - iter 795/2650 - loss 0.02135439 - time (sec): 58.84 - samples/sec: 7462.87 - lr: 0.050000
2023-04-06 02:32:54,653 epoch 74 - iter 1060/2650 - loss 0.02141306 - time (sec): 78.18 - samples/sec: 7500.81 - lr: 0.050000
2023-04-06 02:33:14,067 epoch 74 - iter 1325/2650 - loss 0.02122902 - time (sec): 97.59 - samples/sec: 7505.46 - lr: 0.050000
2023-04-06 02:33:43,049 epoch 74 - iter 1590/2650 - loss 0.02158316 - time (sec): 126.57 - samples/sec: 6956.66 - lr: 0.050000
2023-04-06 02:34:01,816 epoch 74 - iter 1855/2650 - loss 0.02152591 - time (sec): 145.34 - samples/sec: 7056.11 - lr: 0.050000
2023-04-06 02:34:20,988 epoch 74 - iter 2120/2650 - loss 0.02162005 - time (sec): 164.51 - samples/sec: 7117.86 - lr: 0.050000
2023-04-06 02:34:40,162 epoch 74 - iter 2385/2650 - loss 0.02176786 - time (sec): 183.69 - samples/sec: 7170.50 - lr: 0.050000
2023-04-06 02:34:59,047 epoch 74 - iter 2650/2650 - loss 0.02169257 - time (sec): 202.57 - samples/sec: 7219.02 - lr: 0.050000
2023-04-06 02:34:59,047 ----------------------------------------------------------------------------------------------------
2023-04-06 02:34:59,047 EPOCH 74 done: loss 0.0217 - lr 0.050000
2023-04-06 02:34:59,047 BAD EPOCHS (no improvement): 0
2023-04-06 02:34:59,051 ----------------------------------------------------------------------------------------------------
2023-04-06 02:35:18,418 epoch 75 - iter 265/2650 - loss 0.02188341 - time (sec): 19.37 - samples/sec: 7538.19 - lr: 0.050000
2023-04-06 02:35:38,179 epoch 75 - iter 530/2650 - loss 0.02203009 - time (sec): 39.13 - samples/sec: 7491.93 - lr: 0.050000
2023-04-06 02:35:57,305 epoch 75 - iter 795/2650 - loss 0.02187496 - time (sec): 58.25 - samples/sec: 7523.01 - lr: 0.050000
2023-04-06 02:36:16,041 epoch 75 - iter 1060/2650 - loss 0.02193217 - time (sec): 76.99 - samples/sec: 7576.77 - lr: 0.050000
2023-04-06 02:36:35,088 epoch 75 - iter 1325/2650 - loss 0.02178260 - time (sec): 96.04 - samples/sec: 7587.68 - lr: 0.050000
2023-04-06 02:36:54,778 epoch 75 - iter 1590/2650 - loss 0.02194509 - time (sec): 115.73 - samples/sec: 7571.82 - lr: 0.050000
2023-04-06 02:37:13,715 epoch 75 - iter 1855/2650 - loss 0.02176340 - time (sec): 134.66 - samples/sec: 7592.99 - lr: 0.050000
2023-04-06 02:37:33,973 epoch 75 - iter 2120/2650 - loss 0.02173343 - time (sec): 154.92 - samples/sec: 7556.28 - lr: 0.050000
2023-04-06 02:37:52,839 epoch 75 - iter 2385/2650 - loss 0.02170370 - time (sec): 173.79 - samples/sec: 7567.58 - lr: 0.050000
2023-04-06 02:38:12,708 epoch 75 - iter 2650/2650 - loss 0.02190977 - time (sec): 193.66 - samples/sec: 7551.32 - lr: 0.050000
2023-04-06 02:38:12,708 ----------------------------------------------------------------------------------------------------
2023-04-06 02:38:12,708 EPOCH 75 done: loss 0.0219 - lr 0.050000
2023-04-06 02:38:12,709 BAD EPOCHS (no improvement): 1
2023-04-06 02:38:12,712 ----------------------------------------------------------------------------------------------------
2023-04-06 02:38:31,731 epoch 76 - iter 265/2650 - loss 0.02082060 - time (sec): 19.02 - samples/sec: 7695.36 - lr: 0.050000
2023-04-06 02:38:50,963 epoch 76 - iter 530/2650 - loss 0.02118057 - time (sec): 38.25 - samples/sec: 7584.77 - lr: 0.050000
2023-04-06 02:39:10,148 epoch 76 - iter 795/2650 - loss 0.02085589 - time (sec): 57.44 - samples/sec: 7594.63 - lr: 0.050000
2023-04-06 02:39:29,430 epoch 76 - iter 1060/2650 - loss 0.02099816 - time (sec): 76.72 - samples/sec: 7619.33 - lr: 0.050000
2023-04-06 02:39:49,204 epoch 76 - iter 1325/2650 - loss 0.02113610 - time (sec): 96.49 - samples/sec: 7593.02 - lr: 0.050000
2023-04-06 02:40:08,216 epoch 76 - iter 1590/2650 - loss 0.02113944 - time (sec): 115.50 - samples/sec: 7601.32 - lr: 0.050000
2023-04-06 02:40:27,793 epoch 76 - iter 1855/2650 - loss 0.02116299 - time (sec): 135.08 - samples/sec: 7587.09 - lr: 0.050000
2023-04-06 02:40:46,902 epoch 76 - iter 2120/2650 - loss 0.02112250 - time (sec): 154.19 - samples/sec: 7582.46 - lr: 0.050000
2023-04-06 02:41:05,938 epoch 76 - iter 2385/2650 - loss 0.02119870 - time (sec): 173.23 - samples/sec: 7591.27 - lr: 0.050000
2023-04-06 02:41:25,549 epoch 76 - iter 2650/2650 - loss 0.02131471 - time (sec): 192.84 - samples/sec: 7583.44 - lr: 0.050000
2023-04-06 02:41:25,549 ----------------------------------------------------------------------------------------------------
2023-04-06 02:41:25,549 EPOCH 76 done: loss 0.0213 - lr 0.050000
2023-04-06 02:41:25,549 BAD EPOCHS (no improvement): 0
2023-04-06 02:41:25,553 ----------------------------------------------------------------------------------------------------
2023-04-06 02:41:44,038 epoch 77 - iter 265/2650 - loss 0.02037441 - time (sec): 18.48 - samples/sec: 7805.79 - lr: 0.050000
2023-04-06 02:42:03,501 epoch 77 - iter 530/2650 - loss 0.02150523 - time (sec): 37.95 - samples/sec: 7681.25 - lr: 0.050000
2023-04-06 02:42:22,702 epoch 77 - iter 795/2650 - loss 0.02170566 - time (sec): 57.15 - samples/sec: 7641.01 - lr: 0.050000
2023-04-06 02:42:41,801 epoch 77 - iter 1060/2650 - loss 0.02184497 - time (sec): 76.25 - samples/sec: 7637.36 - lr: 0.050000
2023-04-06 02:43:01,022 epoch 77 - iter 1325/2650 - loss 0.02182261 - time (sec): 95.47 - samples/sec: 7639.35 - lr: 0.050000
2023-04-06 02:43:20,771 epoch 77 - iter 1590/2650 - loss 0.02190412 - time (sec): 115.22 - samples/sec: 7610.74 - lr: 0.050000
2023-04-06 02:43:40,245 epoch 77 - iter 1855/2650 - loss 0.02186369 - time (sec): 134.69 - samples/sec: 7596.49 - lr: 0.050000
2023-04-06 02:43:59,443 epoch 77 - iter 2120/2650 - loss 0.02182253 - time (sec): 153.89 - samples/sec: 7604.75 - lr: 0.050000
2023-04-06 02:44:18,788 epoch 77 - iter 2385/2650 - loss 0.02178251 - time (sec): 173.23 - samples/sec: 7592.76 - lr: 0.050000
2023-04-06 02:44:38,653 epoch 77 - iter 2650/2650 - loss 0.02176863 - time (sec): 193.10 - samples/sec: 7573.12 - lr: 0.050000
2023-04-06 02:44:38,653 ----------------------------------------------------------------------------------------------------
2023-04-06 02:44:38,653 EPOCH 77 done: loss 0.0218 - lr 0.050000
2023-04-06 02:44:38,653 BAD EPOCHS (no improvement): 1
2023-04-06 02:44:38,657 ----------------------------------------------------------------------------------------------------
2023-04-06 02:44:57,786 epoch 78 - iter 265/2650 - loss 0.02005819 - time (sec): 19.13 - samples/sec: 7634.22 - lr: 0.050000
2023-04-06 02:45:17,192 epoch 78 - iter 530/2650 - loss 0.02070203 - time (sec): 38.53 - samples/sec: 7620.82 - lr: 0.050000
2023-04-06 02:45:36,732 epoch 78 - iter 795/2650 - loss 0.02121885 - time (sec): 58.08 - samples/sec: 7586.78 - lr: 0.050000
2023-04-06 02:45:56,151 epoch 78 - iter 1060/2650 - loss 0.02145587 - time (sec): 77.49 - samples/sec: 7571.92 - lr: 0.050000
2023-04-06 02:46:16,077 epoch 78 - iter 1325/2650 - loss 0.02159165 - time (sec): 97.42 - samples/sec: 7535.74 - lr: 0.050000
2023-04-06 02:46:35,260 epoch 78 - iter 1590/2650 - loss 0.02140849 - time (sec): 116.60 - samples/sec: 7542.94 - lr: 0.050000
2023-04-06 02:46:54,581 epoch 78 - iter 1855/2650 - loss 0.02138362 - time (sec): 135.92 - samples/sec: 7552.01 - lr: 0.050000
2023-04-06 02:47:13,846 epoch 78 - iter 2120/2650 - loss 0.02128844 - time (sec): 155.19 - samples/sec: 7555.89 - lr: 0.050000
2023-04-06 02:47:33,078 epoch 78 - iter 2385/2650 - loss 0.02125868 - time (sec): 174.42 - samples/sec: 7556.21 - lr: 0.050000
2023-04-06 02:47:52,259 epoch 78 - iter 2650/2650 - loss 0.02144732 - time (sec): 193.60 - samples/sec: 7553.46 - lr: 0.050000
2023-04-06 02:47:52,259 ----------------------------------------------------------------------------------------------------
2023-04-06 02:47:52,259 EPOCH 78 done: loss 0.0214 - lr 0.050000
2023-04-06 02:47:52,259 BAD EPOCHS (no improvement): 2
2023-04-06 02:47:52,263 ----------------------------------------------------------------------------------------------------
2023-04-06 02:48:11,542 epoch 79 - iter 265/2650 - loss 0.02137987 - time (sec): 19.28 - samples/sec: 7635.29 - lr: 0.050000
2023-04-06 02:48:30,890 epoch 79 - iter 530/2650 - loss 0.02108487 - time (sec): 38.63 - samples/sec: 7573.68 - lr: 0.050000
2023-04-06 02:48:49,807 epoch 79 - iter 795/2650 - loss 0.02117952 - time (sec): 57.54 - samples/sec: 7637.53 - lr: 0.050000
2023-04-06 02:49:08,729 epoch 79 - iter 1060/2650 - loss 0.02127776 - time (sec): 76.47 - samples/sec: 7645.05 - lr: 0.050000
2023-04-06 02:49:28,292 epoch 79 - iter 1325/2650 - loss 0.02148286 - time (sec): 96.03 - samples/sec: 7610.92 - lr: 0.050000
2023-04-06 02:49:47,474 epoch 79 - iter 1590/2650 - loss 0.02128983 - time (sec): 115.21 - samples/sec: 7600.67 - lr: 0.050000
2023-04-06 02:50:07,477 epoch 79 - iter 1855/2650 - loss 0.02132846 - time (sec): 135.21 - samples/sec: 7570.59 - lr: 0.050000
2023-04-06 02:50:26,920 epoch 79 - iter 2120/2650 - loss 0.02132384 - time (sec): 154.66 - samples/sec: 7560.13 - lr: 0.050000
2023-04-06 02:50:46,565 epoch 79 - iter 2385/2650 - loss 0.02136009 - time (sec): 174.30 - samples/sec: 7555.75 - lr: 0.050000
2023-04-06 02:51:05,749 epoch 79 - iter 2650/2650 - loss 0.02138881 - time (sec): 193.49 - samples/sec: 7557.98 - lr: 0.050000
2023-04-06 02:51:05,749 ----------------------------------------------------------------------------------------------------
2023-04-06 02:51:05,749 EPOCH 79 done: loss 0.0214 - lr 0.050000
2023-04-06 02:51:05,749 BAD EPOCHS (no improvement): 3
2023-04-06 02:51:05,752 ----------------------------------------------------------------------------------------------------
2023-04-06 02:51:25,370 epoch 80 - iter 265/2650 - loss 0.02085673 - time (sec): 19.62 - samples/sec: 7574.79 - lr: 0.050000
2023-04-06 02:51:45,238 epoch 80 - iter 530/2650 - loss 0.02161601 - time (sec): 39.49 - samples/sec: 7481.45 - lr: 0.050000
2023-04-06 02:52:04,892 epoch 80 - iter 795/2650 - loss 0.02164308 - time (sec): 59.14 - samples/sec: 7509.44 - lr: 0.050000
2023-04-06 02:52:23,525 epoch 80 - iter 1060/2650 - loss 0.02155891 - time (sec): 77.77 - samples/sec: 7570.04 - lr: 0.050000
2023-04-06 02:52:43,442 epoch 80 - iter 1325/2650 - loss 0.02139747 - time (sec): 97.69 - samples/sec: 7524.85 - lr: 0.050000
2023-04-06 02:53:02,386 epoch 80 - iter 1590/2650 - loss 0.02128583 - time (sec): 116.63 - samples/sec: 7542.75 - lr: 0.050000
2023-04-06 02:53:21,223 epoch 80 - iter 1855/2650 - loss 0.02123327 - time (sec): 135.47 - samples/sec: 7568.87 - lr: 0.050000
2023-04-06 02:53:39,954 epoch 80 - iter 2120/2650 - loss 0.02121860 - time (sec): 154.20 - samples/sec: 7579.95 - lr: 0.050000
2023-04-06 02:53:59,345 epoch 80 - iter 2385/2650 - loss 0.02114601 - time (sec): 173.59 - samples/sec: 7582.08 - lr: 0.050000
2023-04-06 02:54:18,721 epoch 80 - iter 2650/2650 - loss 0.02113247 - time (sec): 192.97 - samples/sec: 7578.26 - lr: 0.050000
2023-04-06 02:54:18,721 ----------------------------------------------------------------------------------------------------
2023-04-06 02:54:18,721 EPOCH 80 done: loss 0.0211 - lr 0.050000
2023-04-06 02:54:18,721 BAD EPOCHS (no improvement): 0
2023-04-06 02:54:18,724 ----------------------------------------------------------------------------------------------------
2023-04-06 02:54:37,875 epoch 81 - iter 265/2650 - loss 0.02031509 - time (sec): 19.15 - samples/sec: 7562.46 - lr: 0.050000
2023-04-06 02:54:57,385 epoch 81 - iter 530/2650 - loss 0.02033203 - time (sec): 38.66 - samples/sec: 7545.89 - lr: 0.050000
2023-04-06 02:55:17,035 epoch 81 - iter 795/2650 - loss 0.02023107 - time (sec): 58.31 - samples/sec: 7509.93 - lr: 0.050000
2023-04-06 02:55:36,125 epoch 81 - iter 1060/2650 - loss 0.02027924 - time (sec): 77.40 - samples/sec: 7547.45 - lr: 0.050000
2023-04-06 02:55:55,550 epoch 81 - iter 1325/2650 - loss 0.02043279 - time (sec): 96.83 - samples/sec: 7555.71 - lr: 0.050000
2023-04-06 02:56:15,178 epoch 81 - iter 1590/2650 - loss 0.02074000 - time (sec): 116.45 - samples/sec: 7537.63 - lr: 0.050000
2023-04-06 02:56:34,127 epoch 81 - iter 1855/2650 - loss 0.02089810 - time (sec): 135.40 - samples/sec: 7567.10 - lr: 0.050000
2023-04-06 02:56:53,871 epoch 81 - iter 2120/2650 - loss 0.02085589 - time (sec): 155.15 - samples/sec: 7543.74 - lr: 0.050000
2023-04-06 02:57:12,906 epoch 81 - iter 2385/2650 - loss 0.02076801 - time (sec): 174.18 - samples/sec: 7559.36 - lr: 0.050000
2023-04-06 02:57:32,010 epoch 81 - iter 2650/2650 - loss 0.02096402 - time (sec): 193.29 - samples/sec: 7565.82 - lr: 0.050000
2023-04-06 02:57:32,010 ----------------------------------------------------------------------------------------------------
2023-04-06 02:57:32,010 EPOCH 81 done: loss 0.0210 - lr 0.050000
2023-04-06 02:57:32,010 BAD EPOCHS (no improvement): 0
2023-04-06 02:57:32,013 ----------------------------------------------------------------------------------------------------
2023-04-06 02:57:51,491 epoch 82 - iter 265/2650 - loss 0.02072531 - time (sec): 19.48 - samples/sec: 7520.93 - lr: 0.050000
2023-04-06 02:58:11,046 epoch 82 - iter 530/2650 - loss 0.02115192 - time (sec): 39.03 - samples/sec: 7508.94 - lr: 0.050000
2023-04-06 02:58:29,977 epoch 82 - iter 795/2650 - loss 0.02117868 - time (sec): 57.96 - samples/sec: 7557.58 - lr: 0.050000
2023-04-06 02:58:49,769 epoch 82 - iter 1060/2650 - loss 0.02084953 - time (sec): 77.75 - samples/sec: 7527.91 - lr: 0.050000
2023-04-06 02:59:09,113 epoch 82 - iter 1325/2650 - loss 0.02079207 - time (sec): 97.10 - samples/sec: 7547.26 - lr: 0.050000
2023-04-06 02:59:28,544 epoch 82 - iter 1590/2650 - loss 0.02084338 - time (sec): 116.53 - samples/sec: 7540.04 - lr: 0.050000
2023-04-06 02:59:47,806 epoch 82 - iter 1855/2650 - loss 0.02089967 - time (sec): 135.79 - samples/sec: 7541.97 - lr: 0.050000
2023-04-06 03:00:06,825 epoch 82 - iter 2120/2650 - loss 0.02095215 - time (sec): 154.81 - samples/sec: 7553.08 - lr: 0.050000
2023-04-06 03:00:26,343 epoch 82 - iter 2385/2650 - loss 0.02084949 - time (sec): 174.33 - samples/sec: 7553.22 - lr: 0.050000
2023-04-06 03:00:45,472 epoch 82 - iter 2650/2650 - loss 0.02068391 - time (sec): 193.46 - samples/sec: 7559.05 - lr: 0.050000
2023-04-06 03:00:45,472 ----------------------------------------------------------------------------------------------------
2023-04-06 03:00:45,472 EPOCH 82 done: loss 0.0207 - lr 0.050000
2023-04-06 03:00:45,472 BAD EPOCHS (no improvement): 0
2023-04-06 03:00:45,476 ----------------------------------------------------------------------------------------------------
2023-04-06 03:01:05,218 epoch 83 - iter 265/2650 - loss 0.02074123 - time (sec): 19.74 - samples/sec: 7448.19 - lr: 0.050000
2023-04-06 03:01:24,444 epoch 83 - iter 530/2650 - loss 0.02023541 - time (sec): 38.97 - samples/sec: 7524.14 - lr: 0.050000
2023-04-06 03:01:43,599 epoch 83 - iter 795/2650 - loss 0.02037469 - time (sec): 58.12 - samples/sec: 7540.36 - lr: 0.050000
2023-04-06 03:02:02,843 epoch 83 - iter 1060/2650 - loss 0.02079845 - time (sec): 77.37 - samples/sec: 7552.81 - lr: 0.050000
2023-04-06 03:02:22,320 epoch 83 - iter 1325/2650 - loss 0.02110778 - time (sec): 96.84 - samples/sec: 7558.98 - lr: 0.050000
2023-04-06 03:02:41,801 epoch 83 - iter 1590/2650 - loss 0.02082870 - time (sec): 116.33 - samples/sec: 7566.04 - lr: 0.050000
2023-04-06 03:03:00,465 epoch 83 - iter 1855/2650 - loss 0.02085920 - time (sec): 134.99 - samples/sec: 7600.02 - lr: 0.050000
2023-04-06 03:03:19,890 epoch 83 - iter 2120/2650 - loss 0.02093997 - time (sec): 154.41 - samples/sec: 7587.51 - lr: 0.050000
2023-04-06 03:03:39,317 epoch 83 - iter 2385/2650 - loss 0.02100696 - time (sec): 173.84 - samples/sec: 7577.72 - lr: 0.050000
2023-04-06 03:03:59,329 epoch 83 - iter 2650/2650 - loss 0.02104342 - time (sec): 193.85 - samples/sec: 7543.66 - lr: 0.050000
2023-04-06 03:03:59,330 ----------------------------------------------------------------------------------------------------
2023-04-06 03:03:59,330 EPOCH 83 done: loss 0.0210 - lr 0.050000
2023-04-06 03:03:59,330 BAD EPOCHS (no improvement): 1
2023-04-06 03:03:59,334 ----------------------------------------------------------------------------------------------------
2023-04-06 03:04:19,034 epoch 84 - iter 265/2650 - loss 0.02098481 - time (sec): 19.70 - samples/sec: 7441.69 - lr: 0.050000
2023-04-06 03:04:38,111 epoch 84 - iter 530/2650 - loss 0.02085077 - time (sec): 38.78 - samples/sec: 7507.39 - lr: 0.050000
2023-04-06 03:04:57,534 epoch 84 - iter 795/2650 - loss 0.02060763 - time (sec): 58.20 - samples/sec: 7508.78 - lr: 0.050000
2023-04-06 03:05:16,653 epoch 84 - iter 1060/2650 - loss 0.02075731 - time (sec): 77.32 - samples/sec: 7538.07 - lr: 0.050000
2023-04-06 03:05:36,033 epoch 84 - iter 1325/2650 - loss 0.02108550 - time (sec): 96.70 - samples/sec: 7528.45 - lr: 0.050000
2023-04-06 03:05:55,244 epoch 84 - iter 1590/2650 - loss 0.02114228 - time (sec): 115.91 - samples/sec: 7545.31 - lr: 0.050000
2023-04-06 03:06:15,177 epoch 84 - iter 1855/2650 - loss 0.02101618 - time (sec): 135.84 - samples/sec: 7522.49 - lr: 0.050000
2023-04-06 03:06:34,475 epoch 84 - iter 2120/2650 - loss 0.02116557 - time (sec): 155.14 - samples/sec: 7525.41 - lr: 0.050000
2023-04-06 03:06:53,707 epoch 84 - iter 2385/2650 - loss 0.02113085 - time (sec): 174.37 - samples/sec: 7535.89 - lr: 0.050000
2023-04-06 03:07:13,304 epoch 84 - iter 2650/2650 - loss 0.02118390 - time (sec): 193.97 - samples/sec: 7539.13 - lr: 0.050000
2023-04-06 03:07:13,304 ----------------------------------------------------------------------------------------------------
2023-04-06 03:07:13,304 EPOCH 84 done: loss 0.0212 - lr 0.050000
2023-04-06 03:07:13,305 BAD EPOCHS (no improvement): 2
2023-04-06 03:07:13,308 ----------------------------------------------------------------------------------------------------
2023-04-06 03:07:33,233 epoch 85 - iter 265/2650 - loss 0.02052031 - time (sec): 19.92 - samples/sec: 7392.80 - lr: 0.050000
2023-04-06 03:07:52,779 epoch 85 - iter 530/2650 - loss 0.02058148 - time (sec): 39.47 - samples/sec: 7432.20 - lr: 0.050000
2023-04-06 03:08:11,945 epoch 85 - iter 795/2650 - loss 0.02101608 - time (sec): 58.64 - samples/sec: 7476.27 - lr: 0.050000
2023-04-06 03:08:31,032 epoch 85 - iter 1060/2650 - loss 0.02106849 - time (sec): 77.72 - samples/sec: 7528.62 - lr: 0.050000
2023-04-06 03:08:49,824 epoch 85 - iter 1325/2650 - loss 0.02110864 - time (sec): 96.52 - samples/sec: 7553.03 - lr: 0.050000
2023-04-06 03:09:09,204 epoch 85 - iter 1590/2650 - loss 0.02122742 - time (sec): 115.90 - samples/sec: 7560.96 - lr: 0.050000
2023-04-06 03:09:28,800 epoch 85 - iter 1855/2650 - loss 0.02098129 - time (sec): 135.49 - samples/sec: 7548.34 - lr: 0.050000
2023-04-06 03:09:48,358 epoch 85 - iter 2120/2650 - loss 0.02098534 - time (sec): 155.05 - samples/sec: 7542.51 - lr: 0.050000
2023-04-06 03:10:07,365 epoch 85 - iter 2385/2650 - loss 0.02101951 - time (sec): 174.06 - samples/sec: 7558.47 - lr: 0.050000
2023-04-06 03:10:27,024 epoch 85 - iter 2650/2650 - loss 0.02098455 - time (sec): 193.72 - samples/sec: 7549.02 - lr: 0.050000
2023-04-06 03:10:27,025 ----------------------------------------------------------------------------------------------------
2023-04-06 03:10:27,025 EPOCH 85 done: loss 0.0210 - lr 0.050000
2023-04-06 03:10:27,025 BAD EPOCHS (no improvement): 3
2023-04-06 03:10:27,029 ----------------------------------------------------------------------------------------------------
2023-04-06 03:10:46,114 epoch 86 - iter 265/2650 - loss 0.02016689 - time (sec): 19.08 - samples/sec: 7618.50 - lr: 0.050000
2023-04-06 03:11:05,378 epoch 86 - iter 530/2650 - loss 0.02002309 - time (sec): 38.35 - samples/sec: 7595.69 - lr: 0.050000
2023-04-06 03:11:25,165 epoch 86 - iter 795/2650 - loss 0.02004330 - time (sec): 58.14 - samples/sec: 7552.04 - lr: 0.050000
2023-04-06 03:11:44,889 epoch 86 - iter 1060/2650 - loss 0.02052691 - time (sec): 77.86 - samples/sec: 7516.91 - lr: 0.050000
2023-04-06 03:12:04,291 epoch 86 - iter 1325/2650 - loss 0.02051491 - time (sec): 97.26 - samples/sec: 7532.71 - lr: 0.050000
2023-04-06 03:12:23,376 epoch 86 - iter 1590/2650 - loss 0.02042649 - time (sec): 116.35 - samples/sec: 7560.88 - lr: 0.050000
2023-04-06 03:12:42,975 epoch 86 - iter 1855/2650 - loss 0.02071259 - time (sec): 135.95 - samples/sec: 7558.47 - lr: 0.050000
2023-04-06 03:13:02,008 epoch 86 - iter 2120/2650 - loss 0.02079442 - time (sec): 154.98 - samples/sec: 7563.74 - lr: 0.050000
2023-04-06 03:13:20,729 epoch 86 - iter 2385/2650 - loss 0.02077335 - time (sec): 173.70 - samples/sec: 7590.87 - lr: 0.050000
2023-04-06 03:13:39,908 epoch 86 - iter 2650/2650 - loss 0.02087374 - time (sec): 192.88 - samples/sec: 7581.76 - lr: 0.050000
2023-04-06 03:13:39,908 ----------------------------------------------------------------------------------------------------
2023-04-06 03:13:39,909 EPOCH 86 done: loss 0.0209 - lr 0.050000
2023-04-06 03:13:39,909 Epoch 86: reducing learning rate of group 0 to 2.5000e-02.
2023-04-06 03:13:39,909 BAD EPOCHS (no improvement): 4
2023-04-06 03:13:39,912 ----------------------------------------------------------------------------------------------------
2023-04-06 03:13:59,244 epoch 87 - iter 265/2650 - loss 0.01972096 - time (sec): 19.33 - samples/sec: 7599.16 - lr: 0.025000
2023-04-06 03:14:18,836 epoch 87 - iter 530/2650 - loss 0.02021924 - time (sec): 38.92 - samples/sec: 7532.11 - lr: 0.025000
2023-04-06 03:14:38,724 epoch 87 - iter 795/2650 - loss 0.02025133 - time (sec): 58.81 - samples/sec: 7452.60 - lr: 0.025000
2023-04-06 03:14:57,783 epoch 87 - iter 1060/2650 - loss 0.01983906 - time (sec): 77.87 - samples/sec: 7500.80 - lr: 0.025000
2023-04-06 03:15:17,445 epoch 87 - iter 1325/2650 - loss 0.01993750 - time (sec): 97.53 - samples/sec: 7511.42 - lr: 0.025000
2023-04-06 03:15:36,633 epoch 87 - iter 1590/2650 - loss 0.01973408 - time (sec): 116.72 - samples/sec: 7518.21 - lr: 0.025000
2023-04-06 03:15:55,856 epoch 87 - iter 1855/2650 - loss 0.01967572 - time (sec): 135.94 - samples/sec: 7520.00 - lr: 0.025000
2023-04-06 03:16:15,446 epoch 87 - iter 2120/2650 - loss 0.01976136 - time (sec): 155.53 - samples/sec: 7517.87 - lr: 0.025000
2023-04-06 03:16:34,635 epoch 87 - iter 2385/2650 - loss 0.01959570 - time (sec): 174.72 - samples/sec: 7532.82 - lr: 0.025000
2023-04-06 03:16:54,224 epoch 87 - iter 2650/2650 - loss 0.01960481 - time (sec): 194.31 - samples/sec: 7525.85 - lr: 0.025000
2023-04-06 03:16:54,224 ----------------------------------------------------------------------------------------------------
2023-04-06 03:16:54,224 EPOCH 87 done: loss 0.0196 - lr 0.025000
2023-04-06 03:16:54,224 BAD EPOCHS (no improvement): 0
2023-04-06 03:16:54,228 ----------------------------------------------------------------------------------------------------
2023-04-06 03:17:13,682 epoch 88 - iter 265/2650 - loss 0.01854281 - time (sec): 19.45 - samples/sec: 7533.88 - lr: 0.025000
2023-04-06 03:17:33,143 epoch 88 - iter 530/2650 - loss 0.01889028 - time (sec): 38.92 - samples/sec: 7549.68 - lr: 0.025000
2023-04-06 03:17:51,920 epoch 88 - iter 795/2650 - loss 0.01915780 - time (sec): 57.69 - samples/sec: 7628.89 - lr: 0.025000
2023-04-06 03:18:11,425 epoch 88 - iter 1060/2650 - loss 0.01896526 - time (sec): 77.20 - samples/sec: 7602.05 - lr: 0.025000
2023-04-06 03:18:30,447 epoch 88 - iter 1325/2650 - loss 0.01900077 - time (sec): 96.22 - samples/sec: 7617.11 - lr: 0.025000
2023-04-06 03:18:49,519 epoch 88 - iter 1590/2650 - loss 0.01920954 - time (sec): 115.29 - samples/sec: 7615.15 - lr: 0.025000
2023-04-06 03:19:09,761 epoch 88 - iter 1855/2650 - loss 0.01920469 - time (sec): 135.53 - samples/sec: 7570.49 - lr: 0.025000
2023-04-06 03:19:28,581 epoch 88 - iter 2120/2650 - loss 0.01913455 - time (sec): 154.35 - samples/sec: 7588.94 - lr: 0.025000
2023-04-06 03:19:48,277 epoch 88 - iter 2385/2650 - loss 0.01914383 - time (sec): 174.05 - samples/sec: 7568.17 - lr: 0.025000
2023-04-06 03:20:07,844 epoch 88 - iter 2650/2650 - loss 0.01906583 - time (sec): 193.62 - samples/sec: 7552.90 - lr: 0.025000
2023-04-06 03:20:07,844 ----------------------------------------------------------------------------------------------------
2023-04-06 03:20:07,844 EPOCH 88 done: loss 0.0191 - lr 0.025000
2023-04-06 03:20:07,844 BAD EPOCHS (no improvement): 0
2023-04-06 03:20:07,848 ----------------------------------------------------------------------------------------------------
2023-04-06 03:20:27,876 epoch 89 - iter 265/2650 - loss 0.01783701 - time (sec): 20.03 - samples/sec: 7318.19 - lr: 0.025000
2023-04-06 03:20:47,167 epoch 89 - iter 530/2650 - loss 0.01839823 - time (sec): 39.32 - samples/sec: 7421.99 - lr: 0.025000
2023-04-06 03:21:06,504 epoch 89 - iter 795/2650 - loss 0.01853782 - time (sec): 58.66 - samples/sec: 7469.32 - lr: 0.025000
2023-04-06 03:21:25,597 epoch 89 - iter 1060/2650 - loss 0.01832282 - time (sec): 77.75 - samples/sec: 7524.75 - lr: 0.025000
2023-04-06 03:21:45,013 epoch 89 - iter 1325/2650 - loss 0.01843820 - time (sec): 97.16 - samples/sec: 7510.41 - lr: 0.025000
2023-04-06 03:22:04,519 epoch 89 - iter 1590/2650 - loss 0.01826992 - time (sec): 116.67 - samples/sec: 7519.06 - lr: 0.025000
2023-04-06 03:22:24,043 epoch 89 - iter 1855/2650 - loss 0.01829987 - time (sec): 136.20 - samples/sec: 7512.48 - lr: 0.025000
2023-04-06 03:22:43,445 epoch 89 - iter 2120/2650 - loss 0.01844779 - time (sec): 155.60 - samples/sec: 7523.58 - lr: 0.025000
2023-04-06 03:23:02,101 epoch 89 - iter 2385/2650 - loss 0.01843164 - time (sec): 174.25 - samples/sec: 7559.16 - lr: 0.025000
2023-04-06 03:23:21,038 epoch 89 - iter 2650/2650 - loss 0.01851785 - time (sec): 193.19 - samples/sec: 7569.59 - lr: 0.025000
2023-04-06 03:23:21,038 ----------------------------------------------------------------------------------------------------
2023-04-06 03:23:21,038 EPOCH 89 done: loss 0.0185 - lr 0.025000
2023-04-06 03:23:21,038 BAD EPOCHS (no improvement): 0
2023-04-06 03:23:21,041 ----------------------------------------------------------------------------------------------------
2023-04-06 03:23:39,416 epoch 90 - iter 265/2650 - loss 0.01791392 - time (sec): 18.37 - samples/sec: 7864.03 - lr: 0.025000
2023-04-06 03:24:08,280 epoch 90 - iter 530/2650 - loss 0.01830173 - time (sec): 47.24 - samples/sec: 6156.15 - lr: 0.025000
2023-04-06 03:24:27,807 epoch 90 - iter 795/2650 - loss 0.01866672 - time (sec): 66.77 - samples/sec: 6558.85 - lr: 0.025000
2023-04-06 03:24:47,135 epoch 90 - iter 1060/2650 - loss 0.01888757 - time (sec): 86.09 - samples/sec: 6790.83 - lr: 0.025000
2023-04-06 03:25:07,035 epoch 90 - iter 1325/2650 - loss 0.01931051 - time (sec): 105.99 - samples/sec: 6916.52 - lr: 0.025000
2023-04-06 03:25:26,306 epoch 90 - iter 1590/2650 - loss 0.01911919 - time (sec): 125.26 - samples/sec: 7020.37 - lr: 0.025000
2023-04-06 03:25:45,105 epoch 90 - iter 1855/2650 - loss 0.01899358 - time (sec): 144.06 - samples/sec: 7098.02 - lr: 0.025000
2023-04-06 03:26:04,431 epoch 90 - iter 2120/2650 - loss 0.01896251 - time (sec): 163.39 - samples/sec: 7155.28 - lr: 0.025000
2023-04-06 03:26:24,084 epoch 90 - iter 2385/2650 - loss 0.01893267 - time (sec): 183.04 - samples/sec: 7184.12 - lr: 0.025000
2023-04-06 03:26:43,438 epoch 90 - iter 2650/2650 - loss 0.01876960 - time (sec): 202.40 - samples/sec: 7225.24 - lr: 0.025000
2023-04-06 03:26:43,438 ----------------------------------------------------------------------------------------------------
2023-04-06 03:26:43,438 EPOCH 90 done: loss 0.0188 - lr 0.025000
2023-04-06 03:26:43,438 BAD EPOCHS (no improvement): 1
2023-04-06 03:26:43,441 ----------------------------------------------------------------------------------------------------
2023-04-06 03:27:02,338 epoch 91 - iter 265/2650 - loss 0.01871084 - time (sec): 18.90 - samples/sec: 7678.69 - lr: 0.025000
2023-04-06 03:27:20,944 epoch 91 - iter 530/2650 - loss 0.01844487 - time (sec): 37.50 - samples/sec: 7710.32 - lr: 0.025000
2023-04-06 03:27:40,106 epoch 91 - iter 795/2650 - loss 0.01826494 - time (sec): 56.66 - samples/sec: 7679.37 - lr: 0.025000
2023-04-06 03:27:59,168 epoch 91 - iter 1060/2650 - loss 0.01803990 - time (sec): 75.73 - samples/sec: 7678.00 - lr: 0.025000
2023-04-06 03:28:18,968 epoch 91 - iter 1325/2650 - loss 0.01825044 - time (sec): 95.53 - samples/sec: 7627.99 - lr: 0.025000
2023-04-06 03:28:38,079 epoch 91 - iter 1590/2650 - loss 0.01821903 - time (sec): 114.64 - samples/sec: 7620.24 - lr: 0.025000
2023-04-06 03:28:57,989 epoch 91 - iter 1855/2650 - loss 0.01843527 - time (sec): 134.55 - samples/sec: 7603.36 - lr: 0.025000
2023-04-06 03:29:17,613 epoch 91 - iter 2120/2650 - loss 0.01848703 - time (sec): 154.17 - samples/sec: 7593.27 - lr: 0.025000
2023-04-06 03:29:37,045 epoch 91 - iter 2385/2650 - loss 0.01854293 - time (sec): 173.60 - samples/sec: 7590.28 - lr: 0.025000
2023-04-06 03:29:56,019 epoch 91 - iter 2650/2650 - loss 0.01852900 - time (sec): 192.58 - samples/sec: 7593.64 - lr: 0.025000
2023-04-06 03:29:56,019 ----------------------------------------------------------------------------------------------------
2023-04-06 03:29:56,019 EPOCH 91 done: loss 0.0185 - lr 0.025000
2023-04-06 03:29:56,019 BAD EPOCHS (no improvement): 2
2023-04-06 03:29:56,023 ----------------------------------------------------------------------------------------------------
2023-04-06 03:30:15,675 epoch 92 - iter 265/2650 - loss 0.01765611 - time (sec): 19.65 - samples/sec: 7349.68 - lr: 0.025000
2023-04-06 03:30:35,010 epoch 92 - iter 530/2650 - loss 0.01807907 - time (sec): 38.99 - samples/sec: 7476.03 - lr: 0.025000
2023-04-06 03:30:54,183 epoch 92 - iter 795/2650 - loss 0.01799083 - time (sec): 58.16 - samples/sec: 7540.31 - lr: 0.025000
2023-04-06 03:31:14,016 epoch 92 - iter 1060/2650 - loss 0.01819708 - time (sec): 77.99 - samples/sec: 7506.73 - lr: 0.025000
2023-04-06 03:31:33,486 epoch 92 - iter 1325/2650 - loss 0.01842859 - time (sec): 97.46 - samples/sec: 7504.91 - lr: 0.025000
2023-04-06 03:31:52,733 epoch 92 - iter 1590/2650 - loss 0.01829982 - time (sec): 116.71 - samples/sec: 7524.04 - lr: 0.025000
2023-04-06 03:32:11,655 epoch 92 - iter 1855/2650 - loss 0.01844895 - time (sec): 135.63 - samples/sec: 7543.81 - lr: 0.025000
2023-04-06 03:32:30,757 epoch 92 - iter 2120/2650 - loss 0.01833490 - time (sec): 154.73 - samples/sec: 7557.65 - lr: 0.025000
2023-04-06 03:32:50,151 epoch 92 - iter 2385/2650 - loss 0.01825138 - time (sec): 174.13 - samples/sec: 7561.44 - lr: 0.025000
2023-04-06 03:33:09,239 epoch 92 - iter 2650/2650 - loss 0.01821018 - time (sec): 193.22 - samples/sec: 7568.56 - lr: 0.025000
2023-04-06 03:33:09,239 ----------------------------------------------------------------------------------------------------
2023-04-06 03:33:09,239 EPOCH 92 done: loss 0.0182 - lr 0.025000
2023-04-06 03:33:09,239 BAD EPOCHS (no improvement): 0
2023-04-06 03:33:09,243 ----------------------------------------------------------------------------------------------------
2023-04-06 03:33:28,395 epoch 93 - iter 265/2650 - loss 0.01813219 - time (sec): 19.15 - samples/sec: 7659.04 - lr: 0.025000
2023-04-06 03:33:47,818 epoch 93 - iter 530/2650 - loss 0.01819734 - time (sec): 38.57 - samples/sec: 7615.66 - lr: 0.025000
2023-04-06 03:34:07,376 epoch 93 - iter 795/2650 - loss 0.01843689 - time (sec): 58.13 - samples/sec: 7577.98 - lr: 0.025000
2023-04-06 03:34:26,191 epoch 93 - iter 1060/2650 - loss 0.01808521 - time (sec): 76.95 - samples/sec: 7591.28 - lr: 0.025000
2023-04-06 03:34:45,098 epoch 93 - iter 1325/2650 - loss 0.01813144 - time (sec): 95.85 - samples/sec: 7621.03 - lr: 0.025000
2023-04-06 03:35:04,413 epoch 93 - iter 1590/2650 - loss 0.01802458 - time (sec): 115.17 - samples/sec: 7615.69 - lr: 0.025000
2023-04-06 03:35:23,887 epoch 93 - iter 1855/2650 - loss 0.01805462 - time (sec): 134.64 - samples/sec: 7608.42 - lr: 0.025000
2023-04-06 03:35:43,266 epoch 93 - iter 2120/2650 - loss 0.01814312 - time (sec): 154.02 - samples/sec: 7602.00 - lr: 0.025000
2023-04-06 03:36:02,471 epoch 93 - iter 2385/2650 - loss 0.01831666 - time (sec): 173.23 - samples/sec: 7594.17 - lr: 0.025000
2023-04-06 03:36:22,116 epoch 93 - iter 2650/2650 - loss 0.01820257 - time (sec): 192.87 - samples/sec: 7582.03 - lr: 0.025000
2023-04-06 03:36:22,116 ----------------------------------------------------------------------------------------------------
2023-04-06 03:36:22,116 EPOCH 93 done: loss 0.0182 - lr 0.025000
2023-04-06 03:36:22,116 BAD EPOCHS (no improvement): 0
2023-04-06 03:36:22,120 ----------------------------------------------------------------------------------------------------
2023-04-06 03:36:41,318 epoch 94 - iter 265/2650 - loss 0.01711928 - time (sec): 19.20 - samples/sec: 7532.56 - lr: 0.025000
2023-04-06 03:37:00,262 epoch 94 - iter 530/2650 - loss 0.01787308 - time (sec): 38.14 - samples/sec: 7594.06 - lr: 0.025000
2023-04-06 03:37:19,627 epoch 94 - iter 795/2650 - loss 0.01805023 - time (sec): 57.51 - samples/sec: 7597.37 - lr: 0.025000
2023-04-06 03:37:38,814 epoch 94 - iter 1060/2650 - loss 0.01790143 - time (sec): 76.69 - samples/sec: 7591.23 - lr: 0.025000
2023-04-06 03:37:58,362 epoch 94 - iter 1325/2650 - loss 0.01803606 - time (sec): 96.24 - samples/sec: 7579.72 - lr: 0.025000
2023-04-06 03:38:17,969 epoch 94 - iter 1590/2650 - loss 0.01791083 - time (sec): 115.85 - samples/sec: 7557.05 - lr: 0.025000
2023-04-06 03:38:37,230 epoch 94 - iter 1855/2650 - loss 0.01786170 - time (sec): 135.11 - samples/sec: 7561.42 - lr: 0.025000
2023-04-06 03:38:56,894 epoch 94 - iter 2120/2650 - loss 0.01793272 - time (sec): 154.77 - samples/sec: 7549.65 - lr: 0.025000
2023-04-06 03:39:16,332 epoch 94 - iter 2385/2650 - loss 0.01802128 - time (sec): 174.21 - samples/sec: 7551.15 - lr: 0.025000
2023-04-06 03:39:35,760 epoch 94 - iter 2650/2650 - loss 0.01797402 - time (sec): 193.64 - samples/sec: 7551.98 - lr: 0.025000
2023-04-06 03:39:35,761 ----------------------------------------------------------------------------------------------------
2023-04-06 03:39:35,761 EPOCH 94 done: loss 0.0180 - lr 0.025000
2023-04-06 03:39:35,761 BAD EPOCHS (no improvement): 0
2023-04-06 03:39:35,765 ----------------------------------------------------------------------------------------------------
2023-04-06 03:39:55,059 epoch 95 - iter 265/2650 - loss 0.01790416 - time (sec): 19.29 - samples/sec: 7557.57 - lr: 0.025000
2023-04-06 03:40:14,274 epoch 95 - iter 530/2650 - loss 0.01731565 - time (sec): 38.51 - samples/sec: 7560.72 - lr: 0.025000
2023-04-06 03:40:33,909 epoch 95 - iter 795/2650 - loss 0.01791248 - time (sec): 58.14 - samples/sec: 7508.33 - lr: 0.025000
2023-04-06 03:40:53,791 epoch 95 - iter 1060/2650 - loss 0.01808322 - time (sec): 78.03 - samples/sec: 7483.79 - lr: 0.025000
2023-04-06 03:41:12,789 epoch 95 - iter 1325/2650 - loss 0.01797099 - time (sec): 97.02 - samples/sec: 7517.48 - lr: 0.025000
2023-04-06 03:41:31,863 epoch 95 - iter 1590/2650 - loss 0.01792932 - time (sec): 116.10 - samples/sec: 7548.10 - lr: 0.025000
2023-04-06 03:41:51,382 epoch 95 - iter 1855/2650 - loss 0.01800752 - time (sec): 135.62 - samples/sec: 7546.39 - lr: 0.025000
2023-04-06 03:42:10,965 epoch 95 - iter 2120/2650 - loss 0.01803854 - time (sec): 155.20 - samples/sec: 7533.71 - lr: 0.025000
2023-04-06 03:42:30,075 epoch 95 - iter 2385/2650 - loss 0.01798750 - time (sec): 174.31 - samples/sec: 7549.28 - lr: 0.025000
2023-04-06 03:42:49,093 epoch 95 - iter 2650/2650 - loss 0.01811374 - time (sec): 193.33 - samples/sec: 7564.15 - lr: 0.025000
2023-04-06 03:42:49,094 ----------------------------------------------------------------------------------------------------
2023-04-06 03:42:49,094 EPOCH 95 done: loss 0.0181 - lr 0.025000
2023-04-06 03:42:49,094 BAD EPOCHS (no improvement): 1
2023-04-06 03:42:49,098 ----------------------------------------------------------------------------------------------------
2023-04-06 03:43:08,404 epoch 96 - iter 265/2650 - loss 0.01810625 - time (sec): 19.31 - samples/sec: 7551.71 - lr: 0.025000
2023-04-06 03:43:27,540 epoch 96 - iter 530/2650 - loss 0.01815441 - time (sec): 38.44 - samples/sec: 7588.52 - lr: 0.025000
2023-04-06 03:43:47,347 epoch 96 - iter 795/2650 - loss 0.01773251 - time (sec): 58.25 - samples/sec: 7530.07 - lr: 0.025000
2023-04-06 03:44:06,042 epoch 96 - iter 1060/2650 - loss 0.01791772 - time (sec): 76.94 - samples/sec: 7587.15 - lr: 0.025000
2023-04-06 03:44:25,513 epoch 96 - iter 1325/2650 - loss 0.01809560 - time (sec): 96.41 - samples/sec: 7588.94 - lr: 0.025000
2023-04-06 03:44:44,972 epoch 96 - iter 1590/2650 - loss 0.01810324 - time (sec): 115.87 - samples/sec: 7574.38 - lr: 0.025000
2023-04-06 03:45:04,707 epoch 96 - iter 1855/2650 - loss 0.01827558 - time (sec): 135.61 - samples/sec: 7558.64 - lr: 0.025000
2023-04-06 03:45:24,257 epoch 96 - iter 2120/2650 - loss 0.01821144 - time (sec): 155.16 - samples/sec: 7547.39 - lr: 0.025000
2023-04-06 03:45:43,890 epoch 96 - iter 2385/2650 - loss 0.01803239 - time (sec): 174.79 - samples/sec: 7534.69 - lr: 0.025000
2023-04-06 03:46:02,768 epoch 96 - iter 2650/2650 - loss 0.01790687 - time (sec): 193.67 - samples/sec: 7550.79 - lr: 0.025000
2023-04-06 03:46:02,769 ----------------------------------------------------------------------------------------------------
2023-04-06 03:46:02,769 EPOCH 96 done: loss 0.0179 - lr 0.025000
2023-04-06 03:46:02,769 BAD EPOCHS (no improvement): 0
2023-04-06 03:46:02,772 ----------------------------------------------------------------------------------------------------
2023-04-06 03:46:21,808 epoch 97 - iter 265/2650 - loss 0.01842335 - time (sec): 19.04 - samples/sec: 7651.18 - lr: 0.025000
2023-04-06 03:46:40,904 epoch 97 - iter 530/2650 - loss 0.01836702 - time (sec): 38.13 - samples/sec: 7624.98 - lr: 0.025000
2023-04-06 03:46:59,787 epoch 97 - iter 795/2650 - loss 0.01766383 - time (sec): 57.01 - samples/sec: 7684.09 - lr: 0.025000
2023-04-06 03:47:19,214 epoch 97 - iter 1060/2650 - loss 0.01778333 - time (sec): 76.44 - samples/sec: 7635.82 - lr: 0.025000
2023-04-06 03:47:38,205 epoch 97 - iter 1325/2650 - loss 0.01764984 - time (sec): 95.43 - samples/sec: 7646.86 - lr: 0.025000
2023-04-06 03:47:57,910 epoch 97 - iter 1590/2650 - loss 0.01779518 - time (sec): 115.14 - samples/sec: 7607.87 - lr: 0.025000
2023-04-06 03:48:17,644 epoch 97 - iter 1855/2650 - loss 0.01777098 - time (sec): 134.87 - samples/sec: 7580.50 - lr: 0.025000
2023-04-06 03:48:36,952 epoch 97 - iter 2120/2650 - loss 0.01778203 - time (sec): 154.18 - samples/sec: 7575.63 - lr: 0.025000
2023-04-06 03:48:56,316 epoch 97 - iter 2385/2650 - loss 0.01781730 - time (sec): 173.54 - samples/sec: 7577.50 - lr: 0.025000
2023-04-06 03:49:16,281 epoch 97 - iter 2650/2650 - loss 0.01778663 - time (sec): 193.51 - samples/sec: 7557.11 - lr: 0.025000
2023-04-06 03:49:16,281 ----------------------------------------------------------------------------------------------------
2023-04-06 03:49:16,281 EPOCH 97 done: loss 0.0178 - lr 0.025000
2023-04-06 03:49:16,281 BAD EPOCHS (no improvement): 0
2023-04-06 03:49:16,285 ----------------------------------------------------------------------------------------------------
2023-04-06 03:49:35,582 epoch 98 - iter 265/2650 - loss 0.01873937 - time (sec): 19.30 - samples/sec: 7534.37 - lr: 0.025000
2023-04-06 03:49:54,804 epoch 98 - iter 530/2650 - loss 0.01764710 - time (sec): 38.52 - samples/sec: 7580.86 - lr: 0.025000
2023-04-06 03:50:14,042 epoch 98 - iter 795/2650 - loss 0.01768454 - time (sec): 57.76 - samples/sec: 7588.24 - lr: 0.025000
2023-04-06 03:50:33,263 epoch 98 - iter 1060/2650 - loss 0.01774067 - time (sec): 76.98 - samples/sec: 7603.12 - lr: 0.025000
2023-04-06 03:50:52,872 epoch 98 - iter 1325/2650 - loss 0.01775046 - time (sec): 96.59 - samples/sec: 7571.95 - lr: 0.025000
2023-04-06 03:51:12,010 epoch 98 - iter 1590/2650 - loss 0.01765197 - time (sec): 115.72 - samples/sec: 7574.35 - lr: 0.025000
2023-04-06 03:51:31,442 epoch 98 - iter 1855/2650 - loss 0.01765816 - time (sec): 135.16 - samples/sec: 7570.01 - lr: 0.025000
2023-04-06 03:51:50,361 epoch 98 - iter 2120/2650 - loss 0.01758818 - time (sec): 154.08 - samples/sec: 7585.09 - lr: 0.025000
2023-04-06 03:52:09,653 epoch 98 - iter 2385/2650 - loss 0.01754489 - time (sec): 173.37 - samples/sec: 7587.51 - lr: 0.025000
2023-04-06 03:52:29,103 epoch 98 - iter 2650/2650 - loss 0.01757753 - time (sec): 192.82 - samples/sec: 7584.16 - lr: 0.025000
2023-04-06 03:52:29,103 ----------------------------------------------------------------------------------------------------
2023-04-06 03:52:29,103 EPOCH 98 done: loss 0.0176 - lr 0.025000
2023-04-06 03:52:29,103 BAD EPOCHS (no improvement): 0
2023-04-06 03:52:29,107 ----------------------------------------------------------------------------------------------------
2023-04-06 03:52:48,533 epoch 99 - iter 265/2650 - loss 0.01717458 - time (sec): 19.43 - samples/sec: 7514.37 - lr: 0.025000
2023-04-06 03:53:08,232 epoch 99 - iter 530/2650 - loss 0.01716876 - time (sec): 39.12 - samples/sec: 7477.50 - lr: 0.025000
2023-04-06 03:53:27,878 epoch 99 - iter 795/2650 - loss 0.01756395 - time (sec): 58.77 - samples/sec: 7486.66 - lr: 0.025000
2023-04-06 03:53:47,273 epoch 99 - iter 1060/2650 - loss 0.01735170 - time (sec): 78.17 - samples/sec: 7502.60 - lr: 0.025000
2023-04-06 03:54:06,939 epoch 99 - iter 1325/2650 - loss 0.01725931 - time (sec): 97.83 - samples/sec: 7476.96 - lr: 0.025000
2023-04-06 03:54:26,141 epoch 99 - iter 1590/2650 - loss 0.01742408 - time (sec): 117.03 - samples/sec: 7498.20 - lr: 0.025000
2023-04-06 03:54:45,480 epoch 99 - iter 1855/2650 - loss 0.01750477 - time (sec): 136.37 - samples/sec: 7509.30 - lr: 0.025000
2023-04-06 03:55:04,472 epoch 99 - iter 2120/2650 - loss 0.01758883 - time (sec): 155.37 - samples/sec: 7527.48 - lr: 0.025000
2023-04-06 03:55:23,893 epoch 99 - iter 2385/2650 - loss 0.01747955 - time (sec): 174.79 - samples/sec: 7534.47 - lr: 0.025000
2023-04-06 03:55:42,700 epoch 99 - iter 2650/2650 - loss 0.01750235 - time (sec): 193.59 - samples/sec: 7553.81 - lr: 0.025000
2023-04-06 03:55:42,700 ----------------------------------------------------------------------------------------------------
2023-04-06 03:55:42,700 EPOCH 99 done: loss 0.0175 - lr 0.025000
2023-04-06 03:55:42,700 BAD EPOCHS (no improvement): 0
2023-04-06 03:55:42,703 ----------------------------------------------------------------------------------------------------
2023-04-06 03:56:01,924 epoch 100 - iter 265/2650 - loss 0.01653783 - time (sec): 19.22 - samples/sec: 7591.19 - lr: 0.025000
2023-04-06 03:56:20,775 epoch 100 - iter 530/2650 - loss 0.01675143 - time (sec): 38.07 - samples/sec: 7645.45 - lr: 0.025000
2023-04-06 03:56:40,019 epoch 100 - iter 795/2650 - loss 0.01721218 - time (sec): 57.32 - samples/sec: 7636.22 - lr: 0.025000
2023-04-06 03:56:59,734 epoch 100 - iter 1060/2650 - loss 0.01744620 - time (sec): 77.03 - samples/sec: 7574.79 - lr: 0.025000
2023-04-06 03:57:20,084 epoch 100 - iter 1325/2650 - loss 0.01772068 - time (sec): 97.38 - samples/sec: 7512.66 - lr: 0.025000
2023-04-06 03:57:39,413 epoch 100 - iter 1590/2650 - loss 0.01755216 - time (sec): 116.71 - samples/sec: 7520.37 - lr: 0.025000
2023-04-06 03:57:58,445 epoch 100 - iter 1855/2650 - loss 0.01745576 - time (sec): 135.74 - samples/sec: 7545.76 - lr: 0.025000
2023-04-06 03:58:17,193 epoch 100 - iter 2120/2650 - loss 0.01750841 - time (sec): 154.49 - samples/sec: 7568.44 - lr: 0.025000
2023-04-06 03:58:36,863 epoch 100 - iter 2385/2650 - loss 0.01759284 - time (sec): 174.16 - samples/sec: 7551.47 - lr: 0.025000
2023-04-06 03:58:56,527 epoch 100 - iter 2650/2650 - loss 0.01757857 - time (sec): 193.82 - samples/sec: 7544.83 - lr: 0.025000
2023-04-06 03:58:56,527 ----------------------------------------------------------------------------------------------------
2023-04-06 03:58:56,527 EPOCH 100 done: loss 0.0176 - lr 0.025000
2023-04-06 03:58:56,527 BAD EPOCHS (no improvement): 1
2023-04-06 03:58:56,530 ----------------------------------------------------------------------------------------------------
2023-04-06 03:59:16,593 epoch 101 - iter 265/2650 - loss 0.01708355 - time (sec): 20.06 - samples/sec: 7356.97 - lr: 0.025000
2023-04-06 03:59:36,126 epoch 101 - iter 530/2650 - loss 0.01676864 - time (sec): 39.60 - samples/sec: 7424.09 - lr: 0.025000
2023-04-06 03:59:55,855 epoch 101 - iter 795/2650 - loss 0.01737744 - time (sec): 59.33 - samples/sec: 7443.52 - lr: 0.025000
2023-04-06 04:00:14,915 epoch 101 - iter 1060/2650 - loss 0.01718627 - time (sec): 78.39 - samples/sec: 7492.13 - lr: 0.025000
2023-04-06 04:00:34,324 epoch 101 - iter 1325/2650 - loss 0.01738846 - time (sec): 97.79 - samples/sec: 7485.67 - lr: 0.025000
2023-04-06 04:00:53,518 epoch 101 - iter 1590/2650 - loss 0.01738547 - time (sec): 116.99 - samples/sec: 7505.97 - lr: 0.025000
2023-04-06 04:01:12,622 epoch 101 - iter 1855/2650 - loss 0.01723389 - time (sec): 136.09 - samples/sec: 7507.76 - lr: 0.025000
2023-04-06 04:01:31,829 epoch 101 - iter 2120/2650 - loss 0.01720360 - time (sec): 155.30 - samples/sec: 7521.89 - lr: 0.025000
2023-04-06 04:01:52,037 epoch 101 - iter 2385/2650 - loss 0.01715515 - time (sec): 175.51 - samples/sec: 7496.87 - lr: 0.025000
2023-04-06 04:02:11,900 epoch 101 - iter 2650/2650 - loss 0.01716259 - time (sec): 195.37 - samples/sec: 7485.07 - lr: 0.025000
2023-04-06 04:02:11,901 ----------------------------------------------------------------------------------------------------
2023-04-06 04:02:11,901 EPOCH 101 done: loss 0.0172 - lr 0.025000
2023-04-06 04:02:11,901 BAD EPOCHS (no improvement): 0
2023-04-06 04:02:11,905 ----------------------------------------------------------------------------------------------------
2023-04-06 04:02:31,292 epoch 102 - iter 265/2650 - loss 0.01771113 - time (sec): 19.39 - samples/sec: 7562.66 - lr: 0.025000
2023-04-06 04:02:50,751 epoch 102 - iter 530/2650 - loss 0.01683032 - time (sec): 38.85 - samples/sec: 7549.14 - lr: 0.025000
2023-04-06 04:03:10,579 epoch 102 - iter 795/2650 - loss 0.01718120 - time (sec): 58.67 - samples/sec: 7475.45 - lr: 0.025000
2023-04-06 04:03:29,851 epoch 102 - iter 1060/2650 - loss 0.01712816 - time (sec): 77.95 - samples/sec: 7502.14 - lr: 0.025000
2023-04-06 04:03:49,513 epoch 102 - iter 1325/2650 - loss 0.01701223 - time (sec): 97.61 - samples/sec: 7493.42 - lr: 0.025000
2023-04-06 04:04:08,672 epoch 102 - iter 1590/2650 - loss 0.01708399 - time (sec): 116.77 - samples/sec: 7505.23 - lr: 0.025000
2023-04-06 04:04:28,038 epoch 102 - iter 1855/2650 - loss 0.01709347 - time (sec): 136.13 - samples/sec: 7521.19 - lr: 0.025000
2023-04-06 04:04:47,120 epoch 102 - iter 2120/2650 - loss 0.01708681 - time (sec): 155.21 - samples/sec: 7543.42 - lr: 0.025000
2023-04-06 04:05:06,751 epoch 102 - iter 2385/2650 - loss 0.01710372 - time (sec): 174.85 - samples/sec: 7529.01 - lr: 0.025000
2023-04-06 04:05:26,431 epoch 102 - iter 2650/2650 - loss 0.01713520 - time (sec): 194.53 - samples/sec: 7517.55 - lr: 0.025000
2023-04-06 04:05:26,432 ----------------------------------------------------------------------------------------------------
2023-04-06 04:05:26,432 EPOCH 102 done: loss 0.0171 - lr 0.025000
2023-04-06 04:05:26,432 BAD EPOCHS (no improvement): 0
2023-04-06 04:05:26,436 ----------------------------------------------------------------------------------------------------
2023-04-06 04:05:46,838 epoch 103 - iter 265/2650 - loss 0.01693677 - time (sec): 20.40 - samples/sec: 7235.82 - lr: 0.025000
2023-04-06 04:06:05,956 epoch 103 - iter 530/2650 - loss 0.01693291 - time (sec): 39.52 - samples/sec: 7412.55 - lr: 0.025000
2023-04-06 04:06:25,030 epoch 103 - iter 795/2650 - loss 0.01713680 - time (sec): 58.59 - samples/sec: 7476.81 - lr: 0.025000
2023-04-06 04:06:44,335 epoch 103 - iter 1060/2650 - loss 0.01710151 - time (sec): 77.90 - samples/sec: 7478.01 - lr: 0.025000
2023-04-06 04:07:04,468 epoch 103 - iter 1325/2650 - loss 0.01730369 - time (sec): 98.03 - samples/sec: 7437.24 - lr: 0.025000
2023-04-06 04:07:23,745 epoch 103 - iter 1590/2650 - loss 0.01727228 - time (sec): 117.31 - samples/sec: 7464.26 - lr: 0.025000
2023-04-06 04:07:43,283 epoch 103 - iter 1855/2650 - loss 0.01722724 - time (sec): 136.85 - samples/sec: 7477.51 - lr: 0.025000
2023-04-06 04:08:02,896 epoch 103 - iter 2120/2650 - loss 0.01716343 - time (sec): 156.46 - samples/sec: 7476.75 - lr: 0.025000
2023-04-06 04:08:22,176 epoch 103 - iter 2385/2650 - loss 0.01711566 - time (sec): 175.74 - samples/sec: 7495.27 - lr: 0.025000
2023-04-06 04:08:41,029 epoch 103 - iter 2650/2650 - loss 0.01716014 - time (sec): 194.59 - samples/sec: 7514.97 - lr: 0.025000
2023-04-06 04:08:41,029 ----------------------------------------------------------------------------------------------------
2023-04-06 04:08:41,029 EPOCH 103 done: loss 0.0172 - lr 0.025000
2023-04-06 04:08:41,029 BAD EPOCHS (no improvement): 1
2023-04-06 04:08:41,033 ----------------------------------------------------------------------------------------------------
2023-04-06 04:09:01,535 epoch 104 - iter 265/2650 - loss 0.01693417 - time (sec): 20.50 - samples/sec: 7155.75 - lr: 0.025000
2023-04-06 04:09:20,609 epoch 104 - iter 530/2650 - loss 0.01685370 - time (sec): 39.58 - samples/sec: 7400.11 - lr: 0.025000
2023-04-06 04:09:39,567 epoch 104 - iter 795/2650 - loss 0.01692015 - time (sec): 58.53 - samples/sec: 7498.01 - lr: 0.025000
2023-04-06 04:09:59,074 epoch 104 - iter 1060/2650 - loss 0.01705000 - time (sec): 78.04 - samples/sec: 7488.39 - lr: 0.025000
2023-04-06 04:10:18,738 epoch 104 - iter 1325/2650 - loss 0.01719044 - time (sec): 97.71 - samples/sec: 7499.43 - lr: 0.025000
2023-04-06 04:10:38,381 epoch 104 - iter 1590/2650 - loss 0.01725429 - time (sec): 117.35 - samples/sec: 7503.62 - lr: 0.025000
2023-04-06 04:10:57,497 epoch 104 - iter 1855/2650 - loss 0.01709761 - time (sec): 136.46 - samples/sec: 7514.84 - lr: 0.025000
2023-04-06 04:11:17,167 epoch 104 - iter 2120/2650 - loss 0.01711381 - time (sec): 156.13 - samples/sec: 7506.61 - lr: 0.025000
2023-04-06 04:11:36,525 epoch 104 - iter 2385/2650 - loss 0.01709983 - time (sec): 175.49 - samples/sec: 7509.00 - lr: 0.025000
2023-04-06 04:11:55,654 epoch 104 - iter 2650/2650 - loss 0.01710809 - time (sec): 194.62 - samples/sec: 7513.89 - lr: 0.025000
2023-04-06 04:11:55,655 ----------------------------------------------------------------------------------------------------
2023-04-06 04:11:55,655 EPOCH 104 done: loss 0.0171 - lr 0.025000
2023-04-06 04:11:55,655 BAD EPOCHS (no improvement): 0
2023-04-06 04:11:55,658 ----------------------------------------------------------------------------------------------------
2023-04-06 04:12:15,128 epoch 105 - iter 265/2650 - loss 0.01631357 - time (sec): 19.47 - samples/sec: 7510.77 - lr: 0.025000
2023-04-06 04:12:34,317 epoch 105 - iter 530/2650 - loss 0.01673442 - time (sec): 38.66 - samples/sec: 7510.69 - lr: 0.025000
2023-04-06 04:12:53,939 epoch 105 - iter 795/2650 - loss 0.01688012 - time (sec): 58.28 - samples/sec: 7466.04 - lr: 0.025000
2023-04-06 04:13:12,920 epoch 105 - iter 1060/2650 - loss 0.01650347 - time (sec): 77.26 - samples/sec: 7503.40 - lr: 0.025000
2023-04-06 04:13:32,641 epoch 105 - iter 1325/2650 - loss 0.01656678 - time (sec): 96.98 - samples/sec: 7488.10 - lr: 0.025000
2023-04-06 04:13:52,608 epoch 105 - iter 1590/2650 - loss 0.01675649 - time (sec): 116.95 - samples/sec: 7480.16 - lr: 0.025000
2023-04-06 04:14:11,887 epoch 105 - iter 1855/2650 - loss 0.01683192 - time (sec): 136.23 - samples/sec: 7501.78 - lr: 0.025000
2023-04-06 04:14:40,662 epoch 105 - iter 2120/2650 - loss 0.01692088 - time (sec): 165.00 - samples/sec: 7079.35 - lr: 0.025000
2023-04-06 04:15:00,227 epoch 105 - iter 2385/2650 - loss 0.01687815 - time (sec): 184.57 - samples/sec: 7126.79 - lr: 0.025000
2023-04-06 04:15:19,973 epoch 105 - iter 2650/2650 - loss 0.01692683 - time (sec): 204.32 - samples/sec: 7157.39 - lr: 0.025000
2023-04-06 04:15:19,974 ----------------------------------------------------------------------------------------------------
2023-04-06 04:15:19,974 EPOCH 105 done: loss 0.0169 - lr 0.025000
2023-04-06 04:15:19,974 BAD EPOCHS (no improvement): 0
2023-04-06 04:15:19,977 ----------------------------------------------------------------------------------------------------
2023-04-06 04:15:39,817 epoch 106 - iter 265/2650 - loss 0.01673648 - time (sec): 19.84 - samples/sec: 7458.28 - lr: 0.025000
2023-04-06 04:15:59,784 epoch 106 - iter 530/2650 - loss 0.01715912 - time (sec): 39.81 - samples/sec: 7442.78 - lr: 0.025000
2023-04-06 04:16:20,040 epoch 106 - iter 795/2650 - loss 0.01675035 - time (sec): 60.06 - samples/sec: 7396.12 - lr: 0.025000
2023-04-06 04:16:38,975 epoch 106 - iter 1060/2650 - loss 0.01695581 - time (sec): 79.00 - samples/sec: 7453.50 - lr: 0.025000
2023-04-06 04:16:58,872 epoch 106 - iter 1325/2650 - loss 0.01665794 - time (sec): 98.89 - samples/sec: 7430.63 - lr: 0.025000
2023-04-06 04:17:18,918 epoch 106 - iter 1590/2650 - loss 0.01669753 - time (sec): 118.94 - samples/sec: 7414.86 - lr: 0.025000
2023-04-06 04:17:38,280 epoch 106 - iter 1855/2650 - loss 0.01666533 - time (sec): 138.30 - samples/sec: 7436.68 - lr: 0.025000
2023-04-06 04:17:56,821 epoch 106 - iter 2120/2650 - loss 0.01664147 - time (sec): 156.84 - samples/sec: 7471.10 - lr: 0.025000
2023-04-06 04:18:16,098 epoch 106 - iter 2385/2650 - loss 0.01665772 - time (sec): 176.12 - samples/sec: 7479.77 - lr: 0.025000
2023-04-06 04:18:35,545 epoch 106 - iter 2650/2650 - loss 0.01672930 - time (sec): 195.57 - samples/sec: 7477.51 - lr: 0.025000
2023-04-06 04:18:35,546 ----------------------------------------------------------------------------------------------------
2023-04-06 04:18:35,546 EPOCH 106 done: loss 0.0167 - lr 0.025000
2023-04-06 04:18:35,546 BAD EPOCHS (no improvement): 0
2023-04-06 04:18:35,549 ----------------------------------------------------------------------------------------------------
2023-04-06 04:18:55,116 epoch 107 - iter 265/2650 - loss 0.01700508 - time (sec): 19.57 - samples/sec: 7511.28 - lr: 0.025000
2023-04-06 04:19:14,129 epoch 107 - iter 530/2650 - loss 0.01666121 - time (sec): 38.58 - samples/sec: 7604.94 - lr: 0.025000
2023-04-06 04:19:33,583 epoch 107 - iter 795/2650 - loss 0.01687065 - time (sec): 58.03 - samples/sec: 7542.21 - lr: 0.025000
2023-04-06 04:19:53,108 epoch 107 - iter 1060/2650 - loss 0.01674413 - time (sec): 77.56 - samples/sec: 7525.50 - lr: 0.025000
2023-04-06 04:20:13,013 epoch 107 - iter 1325/2650 - loss 0.01678768 - time (sec): 97.46 - samples/sec: 7495.03 - lr: 0.025000
2023-04-06 04:20:32,140 epoch 107 - iter 1590/2650 - loss 0.01692871 - time (sec): 116.59 - samples/sec: 7534.34 - lr: 0.025000
2023-04-06 04:20:51,488 epoch 107 - iter 1855/2650 - loss 0.01694868 - time (sec): 135.94 - samples/sec: 7528.95 - lr: 0.025000
2023-04-06 04:21:11,046 epoch 107 - iter 2120/2650 - loss 0.01675980 - time (sec): 155.50 - samples/sec: 7527.58 - lr: 0.025000
2023-04-06 04:21:30,648 epoch 107 - iter 2385/2650 - loss 0.01670803 - time (sec): 175.10 - samples/sec: 7515.14 - lr: 0.025000
2023-04-06 04:21:50,596 epoch 107 - iter 2650/2650 - loss 0.01666834 - time (sec): 195.05 - samples/sec: 7497.46 - lr: 0.025000
2023-04-06 04:21:50,597 ----------------------------------------------------------------------------------------------------
2023-04-06 04:21:50,597 EPOCH 107 done: loss 0.0167 - lr 0.025000
2023-04-06 04:21:50,597 BAD EPOCHS (no improvement): 0
2023-04-06 04:21:50,600 ----------------------------------------------------------------------------------------------------
2023-04-06 04:22:09,566 epoch 108 - iter 265/2650 - loss 0.01716220 - time (sec): 18.97 - samples/sec: 7621.41 - lr: 0.025000
2023-04-06 04:22:28,648 epoch 108 - iter 530/2650 - loss 0.01683258 - time (sec): 38.05 - samples/sec: 7587.19 - lr: 0.025000
2023-04-06 04:22:48,430 epoch 108 - iter 795/2650 - loss 0.01684678 - time (sec): 57.83 - samples/sec: 7504.31 - lr: 0.025000
2023-04-06 04:23:08,549 epoch 108 - iter 1060/2650 - loss 0.01679694 - time (sec): 77.95 - samples/sec: 7459.97 - lr: 0.025000
2023-04-06 04:23:27,790 epoch 108 - iter 1325/2650 - loss 0.01653009 - time (sec): 97.19 - samples/sec: 7483.36 - lr: 0.025000
2023-04-06 04:23:47,274 epoch 108 - iter 1590/2650 - loss 0.01634798 - time (sec): 116.67 - samples/sec: 7494.83 - lr: 0.025000
2023-04-06 04:24:07,013 epoch 108 - iter 1855/2650 - loss 0.01655042 - time (sec): 136.41 - samples/sec: 7487.22 - lr: 0.025000
2023-04-06 04:24:26,489 epoch 108 - iter 2120/2650 - loss 0.01659093 - time (sec): 155.89 - samples/sec: 7487.43 - lr: 0.025000
2023-04-06 04:24:46,638 epoch 108 - iter 2385/2650 - loss 0.01656739 - time (sec): 176.04 - samples/sec: 7473.98 - lr: 0.025000
2023-04-06 04:25:05,799 epoch 108 - iter 2650/2650 - loss 0.01656419 - time (sec): 195.20 - samples/sec: 7491.64 - lr: 0.025000
2023-04-06 04:25:05,799 ----------------------------------------------------------------------------------------------------
2023-04-06 04:25:05,799 EPOCH 108 done: loss 0.0166 - lr 0.025000
2023-04-06 04:25:05,799 BAD EPOCHS (no improvement): 0
2023-04-06 04:25:05,802 ----------------------------------------------------------------------------------------------------
2023-04-06 04:25:25,059 epoch 109 - iter 265/2650 - loss 0.01699372 - time (sec): 19.26 - samples/sec: 7505.63 - lr: 0.025000
2023-04-06 04:25:44,922 epoch 109 - iter 530/2650 - loss 0.01680227 - time (sec): 39.12 - samples/sec: 7469.97 - lr: 0.025000
2023-04-06 04:26:04,672 epoch 109 - iter 795/2650 - loss 0.01718436 - time (sec): 58.87 - samples/sec: 7477.72 - lr: 0.025000
2023-04-06 04:26:23,887 epoch 109 - iter 1060/2650 - loss 0.01706081 - time (sec): 78.08 - samples/sec: 7502.46 - lr: 0.025000
2023-04-06 04:26:43,007 epoch 109 - iter 1325/2650 - loss 0.01710698 - time (sec): 97.21 - samples/sec: 7517.76 - lr: 0.025000
2023-04-06 04:27:02,407 epoch 109 - iter 1590/2650 - loss 0.01703925 - time (sec): 116.60 - samples/sec: 7518.70 - lr: 0.025000
2023-04-06 04:27:21,907 epoch 109 - iter 1855/2650 - loss 0.01701318 - time (sec): 136.10 - samples/sec: 7510.53 - lr: 0.025000
2023-04-06 04:27:41,434 epoch 109 - iter 2120/2650 - loss 0.01684682 - time (sec): 155.63 - samples/sec: 7514.95 - lr: 0.025000
2023-04-06 04:28:01,438 epoch 109 - iter 2385/2650 - loss 0.01683687 - time (sec): 175.64 - samples/sec: 7495.55 - lr: 0.025000
2023-04-06 04:28:21,167 epoch 109 - iter 2650/2650 - loss 0.01681963 - time (sec): 195.36 - samples/sec: 7485.30 - lr: 0.025000
2023-04-06 04:28:21,167 ----------------------------------------------------------------------------------------------------
2023-04-06 04:28:21,167 EPOCH 109 done: loss 0.0168 - lr 0.025000
2023-04-06 04:28:21,167 BAD EPOCHS (no improvement): 1
2023-04-06 04:28:21,170 ----------------------------------------------------------------------------------------------------
2023-04-06 04:28:40,916 epoch 110 - iter 265/2650 - loss 0.01628376 - time (sec): 19.75 - samples/sec: 7409.00 - lr: 0.025000
2023-04-06 04:29:00,112 epoch 110 - iter 530/2650 - loss 0.01609302 - time (sec): 38.94 - samples/sec: 7485.10 - lr: 0.025000
2023-04-06 04:29:19,558 epoch 110 - iter 795/2650 - loss 0.01640575 - time (sec): 58.39 - samples/sec: 7501.30 - lr: 0.025000
2023-04-06 04:29:38,710 epoch 110 - iter 1060/2650 - loss 0.01642829 - time (sec): 77.54 - samples/sec: 7517.67 - lr: 0.025000
2023-04-06 04:29:58,303 epoch 110 - iter 1325/2650 - loss 0.01654595 - time (sec): 97.13 - samples/sec: 7496.37 - lr: 0.025000
2023-04-06 04:30:17,415 epoch 110 - iter 1590/2650 - loss 0.01668240 - time (sec): 116.24 - samples/sec: 7522.48 - lr: 0.025000
2023-04-06 04:30:36,820 epoch 110 - iter 1855/2650 - loss 0.01666119 - time (sec): 135.65 - samples/sec: 7524.03 - lr: 0.025000
2023-04-06 04:30:57,228 epoch 110 - iter 2120/2650 - loss 0.01670959 - time (sec): 156.06 - samples/sec: 7491.56 - lr: 0.025000
2023-04-06 04:31:16,814 epoch 110 - iter 2385/2650 - loss 0.01654634 - time (sec): 175.64 - samples/sec: 7496.33 - lr: 0.025000
2023-04-06 04:31:36,498 epoch 110 - iter 2650/2650 - loss 0.01649117 - time (sec): 195.33 - samples/sec: 7486.71 - lr: 0.025000
2023-04-06 04:31:36,498 ----------------------------------------------------------------------------------------------------
2023-04-06 04:31:36,498 EPOCH 110 done: loss 0.0165 - lr 0.025000
2023-04-06 04:31:36,498 BAD EPOCHS (no improvement): 0
2023-04-06 04:31:36,502 ----------------------------------------------------------------------------------------------------
2023-04-06 04:31:55,822 epoch 111 - iter 265/2650 - loss 0.01547960 - time (sec): 19.32 - samples/sec: 7653.66 - lr: 0.025000
2023-04-06 04:32:15,205 epoch 111 - iter 530/2650 - loss 0.01568430 - time (sec): 38.70 - samples/sec: 7565.61 - lr: 0.025000
2023-04-06 04:32:34,897 epoch 111 - iter 795/2650 - loss 0.01607482 - time (sec): 58.39 - samples/sec: 7503.04 - lr: 0.025000
2023-04-06 04:32:54,654 epoch 111 - iter 1060/2650 - loss 0.01612207 - time (sec): 78.15 - samples/sec: 7464.00 - lr: 0.025000
2023-04-06 04:33:13,848 epoch 111 - iter 1325/2650 - loss 0.01611149 - time (sec): 97.35 - samples/sec: 7491.33 - lr: 0.025000
2023-04-06 04:33:33,709 epoch 111 - iter 1590/2650 - loss 0.01641411 - time (sec): 117.21 - samples/sec: 7475.68 - lr: 0.025000
2023-04-06 04:33:53,695 epoch 111 - iter 1855/2650 - loss 0.01632107 - time (sec): 137.19 - samples/sec: 7469.89 - lr: 0.025000
2023-04-06 04:34:13,108 epoch 111 - iter 2120/2650 - loss 0.01642444 - time (sec): 156.61 - samples/sec: 7478.19 - lr: 0.025000
2023-04-06 04:34:32,231 epoch 111 - iter 2385/2650 - loss 0.01634078 - time (sec): 175.73 - samples/sec: 7482.79 - lr: 0.025000
2023-04-06 04:34:52,337 epoch 111 - iter 2650/2650 - loss 0.01639498 - time (sec): 195.84 - samples/sec: 7467.30 - lr: 0.025000
2023-04-06 04:34:52,338 ----------------------------------------------------------------------------------------------------
2023-04-06 04:34:52,338 EPOCH 111 done: loss 0.0164 - lr 0.025000
2023-04-06 04:34:52,338 BAD EPOCHS (no improvement): 0
2023-04-06 04:34:52,340 ----------------------------------------------------------------------------------------------------
2023-04-06 04:35:11,809 epoch 112 - iter 265/2650 - loss 0.01689996 - time (sec): 19.47 - samples/sec: 7473.54 - lr: 0.025000
2023-04-06 04:35:31,695 epoch 112 - iter 530/2650 - loss 0.01663563 - time (sec): 39.35 - samples/sec: 7414.24 - lr: 0.025000
2023-04-06 04:35:51,241 epoch 112 - iter 795/2650 - loss 0.01646828 - time (sec): 58.90 - samples/sec: 7444.65 - lr: 0.025000
2023-04-06 04:36:10,563 epoch 112 - iter 1060/2650 - loss 0.01630354 - time (sec): 78.22 - samples/sec: 7486.80 - lr: 0.025000
2023-04-06 04:36:30,165 epoch 112 - iter 1325/2650 - loss 0.01636160 - time (sec): 97.82 - samples/sec: 7493.21 - lr: 0.025000
2023-04-06 04:36:49,512 epoch 112 - iter 1590/2650 - loss 0.01635887 - time (sec): 117.17 - samples/sec: 7500.02 - lr: 0.025000
2023-04-06 04:37:08,296 epoch 112 - iter 1855/2650 - loss 0.01639079 - time (sec): 135.96 - samples/sec: 7526.10 - lr: 0.025000
2023-04-06 04:37:28,064 epoch 112 - iter 2120/2650 - loss 0.01649152 - time (sec): 155.72 - samples/sec: 7511.91 - lr: 0.025000
2023-04-06 04:37:47,686 epoch 112 - iter 2385/2650 - loss 0.01636407 - time (sec): 175.35 - samples/sec: 7514.33 - lr: 0.025000
2023-04-06 04:38:07,327 epoch 112 - iter 2650/2650 - loss 0.01643489 - time (sec): 194.99 - samples/sec: 7499.80 - lr: 0.025000
2023-04-06 04:38:07,328 ----------------------------------------------------------------------------------------------------
2023-04-06 04:38:07,328 EPOCH 112 done: loss 0.0164 - lr 0.025000
2023-04-06 04:38:07,328 BAD EPOCHS (no improvement): 1
2023-04-06 04:38:07,331 ----------------------------------------------------------------------------------------------------
2023-04-06 04:38:26,839 epoch 113 - iter 265/2650 - loss 0.01588915 - time (sec): 19.51 - samples/sec: 7449.40 - lr: 0.025000
2023-04-06 04:38:46,314 epoch 113 - iter 530/2650 - loss 0.01567703 - time (sec): 38.98 - samples/sec: 7494.46 - lr: 0.025000
2023-04-06 04:39:05,624 epoch 113 - iter 795/2650 - loss 0.01571463 - time (sec): 58.29 - samples/sec: 7532.94 - lr: 0.025000
2023-04-06 04:39:24,892 epoch 113 - iter 1060/2650 - loss 0.01554186 - time (sec): 77.56 - samples/sec: 7554.24 - lr: 0.025000
2023-04-06 04:39:44,216 epoch 113 - iter 1325/2650 - loss 0.01570454 - time (sec): 96.88 - samples/sec: 7549.37 - lr: 0.025000
2023-04-06 04:40:04,074 epoch 113 - iter 1590/2650 - loss 0.01562611 - time (sec): 116.74 - samples/sec: 7519.75 - lr: 0.025000
2023-04-06 04:40:24,155 epoch 113 - iter 1855/2650 - loss 0.01569171 - time (sec): 136.82 - samples/sec: 7481.73 - lr: 0.025000
2023-04-06 04:40:43,424 epoch 113 - iter 2120/2650 - loss 0.01575266 - time (sec): 156.09 - samples/sec: 7496.72 - lr: 0.025000
2023-04-06 04:41:02,890 epoch 113 - iter 2385/2650 - loss 0.01588818 - time (sec): 175.56 - samples/sec: 7495.26 - lr: 0.025000
2023-04-06 04:41:22,410 epoch 113 - iter 2650/2650 - loss 0.01589302 - time (sec): 195.08 - samples/sec: 7496.28 - lr: 0.025000
2023-04-06 04:41:22,410 ----------------------------------------------------------------------------------------------------
2023-04-06 04:41:22,410 EPOCH 113 done: loss 0.0159 - lr 0.025000
2023-04-06 04:41:22,410 BAD EPOCHS (no improvement): 0
2023-04-06 04:41:22,414 ----------------------------------------------------------------------------------------------------
2023-04-06 04:41:41,875 epoch 114 - iter 265/2650 - loss 0.01641695 - time (sec): 19.46 - samples/sec: 7484.16 - lr: 0.025000
2023-04-06 04:42:01,777 epoch 114 - iter 530/2650 - loss 0.01602753 - time (sec): 39.36 - samples/sec: 7417.53 - lr: 0.025000
2023-04-06 04:42:20,785 epoch 114 - iter 795/2650 - loss 0.01602222 - time (sec): 58.37 - samples/sec: 7500.14 - lr: 0.025000
2023-04-06 04:42:39,912 epoch 114 - iter 1060/2650 - loss 0.01601872 - time (sec): 77.50 - samples/sec: 7536.27 - lr: 0.025000
2023-04-06 04:42:59,322 epoch 114 - iter 1325/2650 - loss 0.01606059 - time (sec): 96.91 - samples/sec: 7536.92 - lr: 0.025000
2023-04-06 04:43:18,274 epoch 114 - iter 1590/2650 - loss 0.01620848 - time (sec): 115.86 - samples/sec: 7556.75 - lr: 0.025000
2023-04-06 04:43:38,052 epoch 114 - iter 1855/2650 - loss 0.01604542 - time (sec): 135.64 - samples/sec: 7536.29 - lr: 0.025000
2023-04-06 04:43:58,144 epoch 114 - iter 2120/2650 - loss 0.01623785 - time (sec): 155.73 - samples/sec: 7513.91 - lr: 0.025000
2023-04-06 04:44:18,005 epoch 114 - iter 2385/2650 - loss 0.01632508 - time (sec): 175.59 - samples/sec: 7502.14 - lr: 0.025000
2023-04-06 04:44:37,152 epoch 114 - iter 2650/2650 - loss 0.01639981 - time (sec): 194.74 - samples/sec: 7509.39 - lr: 0.025000
2023-04-06 04:44:37,152 ----------------------------------------------------------------------------------------------------
2023-04-06 04:44:37,152 EPOCH 114 done: loss 0.0164 - lr 0.025000
2023-04-06 04:44:37,152 BAD EPOCHS (no improvement): 1
2023-04-06 04:44:37,156 ----------------------------------------------------------------------------------------------------
2023-04-06 04:44:57,155 epoch 115 - iter 265/2650 - loss 0.01589350 - time (sec): 20.00 - samples/sec: 7412.51 - lr: 0.025000
2023-04-06 04:45:16,335 epoch 115 - iter 530/2650 - loss 0.01542737 - time (sec): 39.18 - samples/sec: 7501.90 - lr: 0.025000
2023-04-06 04:45:36,109 epoch 115 - iter 795/2650 - loss 0.01603309 - time (sec): 58.95 - samples/sec: 7444.78 - lr: 0.025000
2023-04-06 04:45:56,056 epoch 115 - iter 1060/2650 - loss 0.01606387 - time (sec): 78.90 - samples/sec: 7427.18 - lr: 0.025000
2023-04-06 04:46:15,235 epoch 115 - iter 1325/2650 - loss 0.01614928 - time (sec): 98.08 - samples/sec: 7459.92 - lr: 0.025000
2023-04-06 04:46:34,601 epoch 115 - iter 1590/2650 - loss 0.01621075 - time (sec): 117.45 - samples/sec: 7473.39 - lr: 0.025000
2023-04-06 04:46:54,036 epoch 115 - iter 1855/2650 - loss 0.01621625 - time (sec): 136.88 - samples/sec: 7483.58 - lr: 0.025000
2023-04-06 04:47:13,430 epoch 115 - iter 2120/2650 - loss 0.01625940 - time (sec): 156.27 - samples/sec: 7487.70 - lr: 0.025000
2023-04-06 04:47:33,464 epoch 115 - iter 2385/2650 - loss 0.01622751 - time (sec): 176.31 - samples/sec: 7470.01 - lr: 0.025000
2023-04-06 04:47:52,640 epoch 115 - iter 2650/2650 - loss 0.01630412 - time (sec): 195.48 - samples/sec: 7480.74 - lr: 0.025000
2023-04-06 04:47:52,640 ----------------------------------------------------------------------------------------------------
2023-04-06 04:47:52,640 EPOCH 115 done: loss 0.0163 - lr 0.025000
2023-04-06 04:47:52,640 BAD EPOCHS (no improvement): 2
2023-04-06 04:47:52,643 ----------------------------------------------------------------------------------------------------
2023-04-06 04:48:11,932 epoch 116 - iter 265/2650 - loss 0.01536506 - time (sec): 19.29 - samples/sec: 7535.20 - lr: 0.025000
2023-04-06 04:48:31,269 epoch 116 - iter 530/2650 - loss 0.01609121 - time (sec): 38.63 - samples/sec: 7530.97 - lr: 0.025000
2023-04-06 04:48:50,319 epoch 116 - iter 795/2650 - loss 0.01622454 - time (sec): 57.68 - samples/sec: 7575.80 - lr: 0.025000
2023-04-06 04:49:10,411 epoch 116 - iter 1060/2650 - loss 0.01612029 - time (sec): 77.77 - samples/sec: 7536.24 - lr: 0.025000
2023-04-06 04:49:30,066 epoch 116 - iter 1325/2650 - loss 0.01615203 - time (sec): 97.42 - samples/sec: 7528.20 - lr: 0.025000
2023-04-06 04:49:49,394 epoch 116 - iter 1590/2650 - loss 0.01613982 - time (sec): 116.75 - samples/sec: 7524.63 - lr: 0.025000
2023-04-06 04:50:09,161 epoch 116 - iter 1855/2650 - loss 0.01635254 - time (sec): 136.52 - samples/sec: 7502.44 - lr: 0.025000
2023-04-06 04:50:29,033 epoch 116 - iter 2120/2650 - loss 0.01640006 - time (sec): 156.39 - samples/sec: 7491.37 - lr: 0.025000
2023-04-06 04:50:48,337 epoch 116 - iter 2385/2650 - loss 0.01637971 - time (sec): 175.69 - samples/sec: 7502.04 - lr: 0.025000
2023-04-06 04:51:07,827 epoch 116 - iter 2650/2650 - loss 0.01640377 - time (sec): 195.18 - samples/sec: 7492.25 - lr: 0.025000
2023-04-06 04:51:07,827 ----------------------------------------------------------------------------------------------------
2023-04-06 04:51:07,827 EPOCH 116 done: loss 0.0164 - lr 0.025000
2023-04-06 04:51:07,827 BAD EPOCHS (no improvement): 3
2023-04-06 04:51:07,830 ----------------------------------------------------------------------------------------------------
2023-04-06 04:51:27,010 epoch 117 - iter 265/2650 - loss 0.01633434 - time (sec): 19.18 - samples/sec: 7570.02 - lr: 0.025000
2023-04-06 04:51:46,495 epoch 117 - iter 530/2650 - loss 0.01653491 - time (sec): 38.67 - samples/sec: 7545.14 - lr: 0.025000
2023-04-06 04:52:06,320 epoch 117 - iter 795/2650 - loss 0.01642501 - time (sec): 58.49 - samples/sec: 7496.55 - lr: 0.025000
2023-04-06 04:52:26,413 epoch 117 - iter 1060/2650 - loss 0.01635945 - time (sec): 78.58 - samples/sec: 7469.19 - lr: 0.025000
2023-04-06 04:52:45,899 epoch 117 - iter 1325/2650 - loss 0.01626213 - time (sec): 98.07 - samples/sec: 7470.09 - lr: 0.025000
2023-04-06 04:53:05,123 epoch 117 - iter 1590/2650 - loss 0.01606817 - time (sec): 117.29 - samples/sec: 7488.13 - lr: 0.025000
2023-04-06 04:53:24,306 epoch 117 - iter 1855/2650 - loss 0.01599580 - time (sec): 136.48 - samples/sec: 7512.75 - lr: 0.025000
2023-04-06 04:53:43,664 epoch 117 - iter 2120/2650 - loss 0.01603104 - time (sec): 155.83 - samples/sec: 7505.18 - lr: 0.025000
2023-04-06 04:54:03,572 epoch 117 - iter 2385/2650 - loss 0.01609921 - time (sec): 175.74 - samples/sec: 7489.18 - lr: 0.025000
2023-04-06 04:54:23,263 epoch 117 - iter 2650/2650 - loss 0.01608671 - time (sec): 195.43 - samples/sec: 7482.66 - lr: 0.025000
2023-04-06 04:54:23,264 ----------------------------------------------------------------------------------------------------
2023-04-06 04:54:23,264 EPOCH 117 done: loss 0.0161 - lr 0.025000
2023-04-06 04:54:23,264 Epoch 117: reducing learning rate of group 0 to 1.2500e-02.
2023-04-06 04:54:23,264 BAD EPOCHS (no improvement): 4
2023-04-06 04:54:23,267 ----------------------------------------------------------------------------------------------------
2023-04-06 04:54:43,207 epoch 118 - iter 265/2650 - loss 0.01572736 - time (sec): 19.94 - samples/sec: 7397.38 - lr: 0.012500
2023-04-06 04:55:03,167 epoch 118 - iter 530/2650 - loss 0.01576683 - time (sec): 39.90 - samples/sec: 7359.14 - lr: 0.012500
2023-04-06 04:55:22,565 epoch 118 - iter 795/2650 - loss 0.01577112 - time (sec): 59.30 - samples/sec: 7418.71 - lr: 0.012500
2023-04-06 04:55:42,039 epoch 118 - iter 1060/2650 - loss 0.01594208 - time (sec): 78.77 - samples/sec: 7441.57 - lr: 0.012500
2023-04-06 04:56:01,557 epoch 118 - iter 1325/2650 - loss 0.01577340 - time (sec): 98.29 - samples/sec: 7448.52 - lr: 0.012500
2023-04-06 04:56:21,217 epoch 118 - iter 1590/2650 - loss 0.01589593 - time (sec): 117.95 - samples/sec: 7454.72 - lr: 0.012500
2023-04-06 04:56:40,511 epoch 118 - iter 1855/2650 - loss 0.01566565 - time (sec): 137.24 - samples/sec: 7461.78 - lr: 0.012500
2023-04-06 04:56:59,741 epoch 118 - iter 2120/2650 - loss 0.01577624 - time (sec): 156.47 - samples/sec: 7477.95 - lr: 0.012500
2023-04-06 04:57:19,407 epoch 118 - iter 2385/2650 - loss 0.01564434 - time (sec): 176.14 - samples/sec: 7478.04 - lr: 0.012500
2023-04-06 04:57:38,335 epoch 118 - iter 2650/2650 - loss 0.01564190 - time (sec): 195.07 - samples/sec: 7496.68 - lr: 0.012500
2023-04-06 04:57:38,335 ----------------------------------------------------------------------------------------------------
2023-04-06 04:57:38,335 EPOCH 118 done: loss 0.0156 - lr 0.012500
2023-04-06 04:57:38,335 BAD EPOCHS (no improvement): 0
2023-04-06 04:57:38,338 ----------------------------------------------------------------------------------------------------
2023-04-06 04:57:57,840 epoch 119 - iter 265/2650 - loss 0.01544832 - time (sec): 19.50 - samples/sec: 7484.69 - lr: 0.012500
2023-04-06 04:58:17,000 epoch 119 - iter 530/2650 - loss 0.01587479 - time (sec): 38.66 - samples/sec: 7518.91 - lr: 0.012500
2023-04-06 04:58:36,746 epoch 119 - iter 795/2650 - loss 0.01555283 - time (sec): 58.41 - samples/sec: 7491.82 - lr: 0.012500
2023-04-06 04:58:56,828 epoch 119 - iter 1060/2650 - loss 0.01553353 - time (sec): 78.49 - samples/sec: 7462.88 - lr: 0.012500
2023-04-06 04:59:16,461 epoch 119 - iter 1325/2650 - loss 0.01547635 - time (sec): 98.12 - samples/sec: 7461.17 - lr: 0.012500
2023-04-06 04:59:36,238 epoch 119 - iter 1590/2650 - loss 0.01561315 - time (sec): 117.90 - samples/sec: 7446.70 - lr: 0.012500
2023-04-06 04:59:55,961 epoch 119 - iter 1855/2650 - loss 0.01553643 - time (sec): 137.62 - samples/sec: 7448.51 - lr: 0.012500
2023-04-06 05:00:15,689 epoch 119 - iter 2120/2650 - loss 0.01549607 - time (sec): 157.35 - samples/sec: 7441.51 - lr: 0.012500
2023-04-06 05:00:35,122 epoch 119 - iter 2385/2650 - loss 0.01548650 - time (sec): 176.78 - samples/sec: 7449.04 - lr: 0.012500
2023-04-06 05:00:54,424 epoch 119 - iter 2650/2650 - loss 0.01550406 - time (sec): 196.09 - samples/sec: 7457.77 - lr: 0.012500
2023-04-06 05:00:54,424 ----------------------------------------------------------------------------------------------------
2023-04-06 05:00:54,424 EPOCH 119 done: loss 0.0155 - lr 0.012500
2023-04-06 05:00:54,425 BAD EPOCHS (no improvement): 0
2023-04-06 05:00:54,427 ----------------------------------------------------------------------------------------------------
2023-04-06 05:01:13,848 epoch 120 - iter 265/2650 - loss 0.01597121 - time (sec): 19.42 - samples/sec: 7571.36 - lr: 0.012500
2023-04-06 05:01:32,913 epoch 120 - iter 530/2650 - loss 0.01562538 - time (sec): 38.49 - samples/sec: 7560.34 - lr: 0.012500
2023-04-06 05:01:52,116 epoch 120 - iter 795/2650 - loss 0.01504738 - time (sec): 57.69 - samples/sec: 7577.00 - lr: 0.012500
2023-04-06 05:02:11,577 epoch 120 - iter 1060/2650 - loss 0.01522187 - time (sec): 77.15 - samples/sec: 7560.62 - lr: 0.012500
2023-04-06 05:02:31,395 epoch 120 - iter 1325/2650 - loss 0.01543737 - time (sec): 96.97 - samples/sec: 7536.76 - lr: 0.012500
2023-04-06 05:02:50,522 epoch 120 - iter 1590/2650 - loss 0.01524426 - time (sec): 116.10 - samples/sec: 7558.10 - lr: 0.012500
2023-04-06 05:03:09,981 epoch 120 - iter 1855/2650 - loss 0.01537529 - time (sec): 135.55 - samples/sec: 7544.10 - lr: 0.012500
2023-04-06 05:03:29,407 epoch 120 - iter 2120/2650 - loss 0.01547088 - time (sec): 154.98 - samples/sec: 7546.16 - lr: 0.012500
2023-04-06 05:03:49,386 epoch 120 - iter 2385/2650 - loss 0.01553172 - time (sec): 174.96 - samples/sec: 7521.27 - lr: 0.012500
2023-04-06 05:04:09,003 epoch 120 - iter 2650/2650 - loss 0.01543942 - time (sec): 194.58 - samples/sec: 7515.65 - lr: 0.012500
2023-04-06 05:04:09,003 ----------------------------------------------------------------------------------------------------
2023-04-06 05:04:09,003 EPOCH 120 done: loss 0.0154 - lr 0.012500
2023-04-06 05:04:09,004 BAD EPOCHS (no improvement): 0
2023-04-06 05:04:09,009 ----------------------------------------------------------------------------------------------------
2023-04-06 05:04:28,765 epoch 121 - iter 265/2650 - loss 0.01497412 - time (sec): 19.76 - samples/sec: 7400.69 - lr: 0.012500
2023-04-06 05:04:48,575 epoch 121 - iter 530/2650 - loss 0.01491218 - time (sec): 39.57 - samples/sec: 7371.06 - lr: 0.012500
2023-04-06 05:05:07,424 epoch 121 - iter 795/2650 - loss 0.01539899 - time (sec): 58.42 - samples/sec: 7452.82 - lr: 0.012500
2023-04-06 05:05:36,724 epoch 121 - iter 1060/2650 - loss 0.01533294 - time (sec): 87.72 - samples/sec: 6641.88 - lr: 0.012500
2023-04-06 05:05:56,474 epoch 121 - iter 1325/2650 - loss 0.01573708 - time (sec): 107.47 - samples/sec: 6797.94 - lr: 0.012500
2023-04-06 05:06:16,239 epoch 121 - iter 1590/2650 - loss 0.01573506 - time (sec): 127.23 - samples/sec: 6893.72 - lr: 0.012500
2023-04-06 05:06:36,098 epoch 121 - iter 1855/2650 - loss 0.01554577 - time (sec): 147.09 - samples/sec: 6973.30 - lr: 0.012500
2023-04-06 05:06:55,557 epoch 121 - iter 2120/2650 - loss 0.01547218 - time (sec): 166.55 - samples/sec: 7036.89 - lr: 0.012500
2023-04-06 05:07:14,558 epoch 121 - iter 2385/2650 - loss 0.01542829 - time (sec): 185.55 - samples/sec: 7105.84 - lr: 0.012500
2023-04-06 05:07:33,788 epoch 121 - iter 2650/2650 - loss 0.01539740 - time (sec): 204.78 - samples/sec: 7141.16 - lr: 0.012500
2023-04-06 05:07:33,789 ----------------------------------------------------------------------------------------------------
2023-04-06 05:07:33,789 EPOCH 121 done: loss 0.0154 - lr 0.012500
2023-04-06 05:07:33,789 BAD EPOCHS (no improvement): 0
2023-04-06 05:07:33,792 ----------------------------------------------------------------------------------------------------
2023-04-06 05:07:52,848 epoch 122 - iter 265/2650 - loss 0.01522095 - time (sec): 19.06 - samples/sec: 7632.38 - lr: 0.012500
2023-04-06 05:08:11,998 epoch 122 - iter 530/2650 - loss 0.01540688 - time (sec): 38.21 - samples/sec: 7635.93 - lr: 0.012500
2023-04-06 05:08:32,034 epoch 122 - iter 795/2650 - loss 0.01539960 - time (sec): 58.24 - samples/sec: 7539.04 - lr: 0.012500
2023-04-06 05:08:51,610 epoch 122 - iter 1060/2650 - loss 0.01555039 - time (sec): 77.82 - samples/sec: 7521.02 - lr: 0.012500
2023-04-06 05:09:11,443 epoch 122 - iter 1325/2650 - loss 0.01545018 - time (sec): 97.65 - samples/sec: 7494.06 - lr: 0.012500
2023-04-06 05:09:31,793 epoch 122 - iter 1590/2650 - loss 0.01533201 - time (sec): 118.00 - samples/sec: 7446.09 - lr: 0.012500
2023-04-06 05:09:51,521 epoch 122 - iter 1855/2650 - loss 0.01518317 - time (sec): 137.73 - samples/sec: 7441.34 - lr: 0.012500
2023-04-06 05:10:11,666 epoch 122 - iter 2120/2650 - loss 0.01505003 - time (sec): 157.87 - samples/sec: 7425.26 - lr: 0.012500
2023-04-06 05:10:30,686 epoch 122 - iter 2385/2650 - loss 0.01508182 - time (sec): 176.89 - samples/sec: 7443.41 - lr: 0.012500
2023-04-06 05:10:50,143 epoch 122 - iter 2650/2650 - loss 0.01498612 - time (sec): 196.35 - samples/sec: 7447.69 - lr: 0.012500
2023-04-06 05:10:50,144 ----------------------------------------------------------------------------------------------------
2023-04-06 05:10:50,144 EPOCH 122 done: loss 0.0150 - lr 0.012500
2023-04-06 05:10:50,144 BAD EPOCHS (no improvement): 0
2023-04-06 05:10:50,146 ----------------------------------------------------------------------------------------------------
2023-04-06 05:11:10,063 epoch 123 - iter 265/2650 - loss 0.01470506 - time (sec): 19.92 - samples/sec: 7423.66 - lr: 0.012500
2023-04-06 05:11:29,486 epoch 123 - iter 530/2650 - loss 0.01487188 - time (sec): 39.34 - samples/sec: 7451.92 - lr: 0.012500
2023-04-06 05:11:49,222 epoch 123 - iter 795/2650 - loss 0.01489015 - time (sec): 59.08 - samples/sec: 7435.45 - lr: 0.012500
2023-04-06 05:12:08,947 epoch 123 - iter 1060/2650 - loss 0.01474841 - time (sec): 78.80 - samples/sec: 7418.42 - lr: 0.012500
2023-04-06 05:12:28,255 epoch 123 - iter 1325/2650 - loss 0.01467092 - time (sec): 98.11 - samples/sec: 7435.07 - lr: 0.012500
2023-04-06 05:12:48,166 epoch 123 - iter 1590/2650 - loss 0.01481652 - time (sec): 118.02 - samples/sec: 7422.88 - lr: 0.012500
2023-04-06 05:13:07,432 epoch 123 - iter 1855/2650 - loss 0.01484548 - time (sec): 137.29 - samples/sec: 7438.84 - lr: 0.012500
2023-04-06 05:13:27,212 epoch 123 - iter 2120/2650 - loss 0.01496353 - time (sec): 157.07 - samples/sec: 7429.72 - lr: 0.012500
2023-04-06 05:13:47,247 epoch 123 - iter 2385/2650 - loss 0.01503001 - time (sec): 177.10 - samples/sec: 7424.86 - lr: 0.012500
2023-04-06 05:14:06,852 epoch 123 - iter 2650/2650 - loss 0.01504476 - time (sec): 196.71 - samples/sec: 7434.29 - lr: 0.012500
2023-04-06 05:14:06,852 ----------------------------------------------------------------------------------------------------
2023-04-06 05:14:06,852 EPOCH 123 done: loss 0.0150 - lr 0.012500
2023-04-06 05:14:06,852 BAD EPOCHS (no improvement): 1
2023-04-06 05:14:06,855 ----------------------------------------------------------------------------------------------------
2023-04-06 05:14:25,863 epoch 124 - iter 265/2650 - loss 0.01539088 - time (sec): 19.01 - samples/sec: 7579.83 - lr: 0.012500
2023-04-06 05:14:45,041 epoch 124 - iter 530/2650 - loss 0.01506490 - time (sec): 38.19 - samples/sec: 7640.23 - lr: 0.012500
2023-04-06 05:15:04,858 epoch 124 - iter 795/2650 - loss 0.01504928 - time (sec): 58.00 - samples/sec: 7542.27 - lr: 0.012500
2023-04-06 05:15:24,342 epoch 124 - iter 1060/2650 - loss 0.01497258 - time (sec): 77.49 - samples/sec: 7516.65 - lr: 0.012500
2023-04-06 05:15:44,537 epoch 124 - iter 1325/2650 - loss 0.01504758 - time (sec): 97.68 - samples/sec: 7466.76 - lr: 0.012500
2023-04-06 05:16:04,348 epoch 124 - iter 1590/2650 - loss 0.01524514 - time (sec): 117.49 - samples/sec: 7464.53 - lr: 0.012500
2023-04-06 05:16:23,723 epoch 124 - iter 1855/2650 - loss 0.01527755 - time (sec): 136.87 - samples/sec: 7475.24 - lr: 0.012500
2023-04-06 05:16:43,793 epoch 124 - iter 2120/2650 - loss 0.01520258 - time (sec): 156.94 - samples/sec: 7460.86 - lr: 0.012500
2023-04-06 05:17:03,585 epoch 124 - iter 2385/2650 - loss 0.01519706 - time (sec): 176.73 - samples/sec: 7454.04 - lr: 0.012500
2023-04-06 05:17:22,742 epoch 124 - iter 2650/2650 - loss 0.01513811 - time (sec): 195.89 - samples/sec: 7465.32 - lr: 0.012500
2023-04-06 05:17:22,743 ----------------------------------------------------------------------------------------------------
2023-04-06 05:17:22,743 EPOCH 124 done: loss 0.0151 - lr 0.012500
2023-04-06 05:17:22,743 BAD EPOCHS (no improvement): 2
2023-04-06 05:17:22,747 ----------------------------------------------------------------------------------------------------
2023-04-06 05:17:42,155 epoch 125 - iter 265/2650 - loss 0.01519875 - time (sec): 19.41 - samples/sec: 7424.43 - lr: 0.012500
2023-04-06 05:18:02,164 epoch 125 - iter 530/2650 - loss 0.01453886 - time (sec): 39.42 - samples/sec: 7404.47 - lr: 0.012500
2023-04-06 05:18:22,162 epoch 125 - iter 795/2650 - loss 0.01475092 - time (sec): 59.42 - samples/sec: 7344.99 - lr: 0.012500
2023-04-06 05:18:41,660 epoch 125 - iter 1060/2650 - loss 0.01459919 - time (sec): 78.91 - samples/sec: 7392.98 - lr: 0.012500
2023-04-06 05:19:00,702 epoch 125 - iter 1325/2650 - loss 0.01459180 - time (sec): 97.95 - samples/sec: 7442.23 - lr: 0.012500
2023-04-06 05:19:20,153 epoch 125 - iter 1590/2650 - loss 0.01453606 - time (sec): 117.41 - samples/sec: 7447.28 - lr: 0.012500
2023-04-06 05:19:39,636 epoch 125 - iter 1855/2650 - loss 0.01454958 - time (sec): 136.89 - samples/sec: 7455.66 - lr: 0.012500
2023-04-06 05:19:59,960 epoch 125 - iter 2120/2650 - loss 0.01468428 - time (sec): 157.21 - samples/sec: 7438.56 - lr: 0.012500
2023-04-06 05:20:19,720 epoch 125 - iter 2385/2650 - loss 0.01479486 - time (sec): 176.97 - samples/sec: 7437.00 - lr: 0.012500
2023-04-06 05:20:39,274 epoch 125 - iter 2650/2650 - loss 0.01489035 - time (sec): 196.53 - samples/sec: 7441.03 - lr: 0.012500
2023-04-06 05:20:39,274 ----------------------------------------------------------------------------------------------------
2023-04-06 05:20:39,274 EPOCH 125 done: loss 0.0149 - lr 0.012500
2023-04-06 05:20:39,274 BAD EPOCHS (no improvement): 0
2023-04-06 05:20:39,277 ----------------------------------------------------------------------------------------------------
2023-04-06 05:20:58,860 epoch 126 - iter 265/2650 - loss 0.01443809 - time (sec): 19.58 - samples/sec: 7481.96 - lr: 0.012500
2023-04-06 05:21:18,302 epoch 126 - iter 530/2650 - loss 0.01480784 - time (sec): 39.02 - samples/sec: 7457.83 - lr: 0.012500
2023-04-06 05:21:37,497 epoch 126 - iter 795/2650 - loss 0.01472748 - time (sec): 58.22 - samples/sec: 7511.14 - lr: 0.012500
2023-04-06 05:21:57,661 epoch 126 - iter 1060/2650 - loss 0.01466737 - time (sec): 78.38 - samples/sec: 7465.04 - lr: 0.012500
2023-04-06 05:22:16,688 epoch 126 - iter 1325/2650 - loss 0.01479125 - time (sec): 97.41 - samples/sec: 7510.36 - lr: 0.012500
2023-04-06 05:22:36,933 epoch 126 - iter 1590/2650 - loss 0.01488758 - time (sec): 117.66 - samples/sec: 7452.42 - lr: 0.012500
2023-04-06 05:22:56,564 epoch 126 - iter 1855/2650 - loss 0.01478776 - time (sec): 137.29 - samples/sec: 7464.98 - lr: 0.012500
2023-04-06 05:23:16,327 epoch 126 - iter 2120/2650 - loss 0.01487469 - time (sec): 157.05 - samples/sec: 7458.96 - lr: 0.012500
2023-04-06 05:23:35,553 epoch 126 - iter 2385/2650 - loss 0.01481078 - time (sec): 176.27 - samples/sec: 7477.71 - lr: 0.012500
2023-04-06 05:23:54,331 epoch 126 - iter 2650/2650 - loss 0.01490695 - time (sec): 195.05 - samples/sec: 7497.26 - lr: 0.012500
2023-04-06 05:23:54,331 ----------------------------------------------------------------------------------------------------
2023-04-06 05:23:54,331 EPOCH 126 done: loss 0.0149 - lr 0.012500
2023-04-06 05:23:54,331 BAD EPOCHS (no improvement): 1
2023-04-06 05:23:54,335 ----------------------------------------------------------------------------------------------------
2023-04-06 05:24:13,560 epoch 127 - iter 265/2650 - loss 0.01494358 - time (sec): 19.22 - samples/sec: 7692.66 - lr: 0.012500
2023-04-06 05:24:31,934 epoch 127 - iter 530/2650 - loss 0.01438251 - time (sec): 37.60 - samples/sec: 7768.28 - lr: 0.012500
2023-04-06 05:24:50,828 epoch 127 - iter 795/2650 - loss 0.01453142 - time (sec): 56.49 - samples/sec: 7727.62 - lr: 0.012500
2023-04-06 05:25:10,570 epoch 127 - iter 1060/2650 - loss 0.01448338 - time (sec): 76.24 - samples/sec: 7646.57 - lr: 0.012500
2023-04-06 05:25:30,351 epoch 127 - iter 1325/2650 - loss 0.01470342 - time (sec): 96.02 - samples/sec: 7586.13 - lr: 0.012500
2023-04-06 05:25:49,210 epoch 127 - iter 1590/2650 - loss 0.01467779 - time (sec): 114.88 - samples/sec: 7618.79 - lr: 0.012500
2023-04-06 05:26:08,853 epoch 127 - iter 1855/2650 - loss 0.01483106 - time (sec): 134.52 - samples/sec: 7600.50 - lr: 0.012500
2023-04-06 05:26:28,519 epoch 127 - iter 2120/2650 - loss 0.01483370 - time (sec): 154.18 - samples/sec: 7586.21 - lr: 0.012500
2023-04-06 05:26:47,915 epoch 127 - iter 2385/2650 - loss 0.01489453 - time (sec): 173.58 - samples/sec: 7583.08 - lr: 0.012500
2023-04-06 05:27:07,077 epoch 127 - iter 2650/2650 - loss 0.01483909 - time (sec): 192.74 - samples/sec: 7587.17 - lr: 0.012500
2023-04-06 05:27:07,077 ----------------------------------------------------------------------------------------------------
2023-04-06 05:27:07,077 EPOCH 127 done: loss 0.0148 - lr 0.012500
2023-04-06 05:27:07,077 BAD EPOCHS (no improvement): 0
2023-04-06 05:27:07,080 ----------------------------------------------------------------------------------------------------
2023-04-06 05:27:26,185 epoch 128 - iter 265/2650 - loss 0.01330828 - time (sec): 19.10 - samples/sec: 7637.54 - lr: 0.012500
2023-04-06 05:27:45,170 epoch 128 - iter 530/2650 - loss 0.01389517 - time (sec): 38.09 - samples/sec: 7672.04 - lr: 0.012500
2023-04-06 05:28:04,920 epoch 128 - iter 795/2650 - loss 0.01439307 - time (sec): 57.84 - samples/sec: 7602.19 - lr: 0.012500
2023-04-06 05:28:23,969 epoch 128 - iter 1060/2650 - loss 0.01442468 - time (sec): 76.89 - samples/sec: 7623.01 - lr: 0.012500
2023-04-06 05:28:43,401 epoch 128 - iter 1325/2650 - loss 0.01435687 - time (sec): 96.32 - samples/sec: 7605.01 - lr: 0.012500
2023-04-06 05:29:02,493 epoch 128 - iter 1590/2650 - loss 0.01448037 - time (sec): 115.41 - samples/sec: 7606.04 - lr: 0.012500
2023-04-06 05:29:22,351 epoch 128 - iter 1855/2650 - loss 0.01457775 - time (sec): 135.27 - samples/sec: 7574.22 - lr: 0.012500
2023-04-06 05:29:41,201 epoch 128 - iter 2120/2650 - loss 0.01458015 - time (sec): 154.12 - samples/sec: 7596.71 - lr: 0.012500
2023-04-06 05:30:00,668 epoch 128 - iter 2385/2650 - loss 0.01455667 - time (sec): 173.59 - samples/sec: 7582.95 - lr: 0.012500
2023-04-06 05:30:20,331 epoch 128 - iter 2650/2650 - loss 0.01469323 - time (sec): 193.25 - samples/sec: 7567.17 - lr: 0.012500
2023-04-06 05:30:20,332 ----------------------------------------------------------------------------------------------------
2023-04-06 05:30:20,332 EPOCH 128 done: loss 0.0147 - lr 0.012500
2023-04-06 05:30:20,332 BAD EPOCHS (no improvement): 0
2023-04-06 05:30:20,334 ----------------------------------------------------------------------------------------------------
2023-04-06 05:30:39,071 epoch 129 - iter 265/2650 - loss 0.01529547 - time (sec): 18.74 - samples/sec: 7756.95 - lr: 0.012500
2023-04-06 05:30:58,404 epoch 129 - iter 530/2650 - loss 0.01535124 - time (sec): 38.07 - samples/sec: 7606.11 - lr: 0.012500
2023-04-06 05:31:18,098 epoch 129 - iter 795/2650 - loss 0.01554276 - time (sec): 57.76 - samples/sec: 7565.15 - lr: 0.012500
2023-04-06 05:31:37,527 epoch 129 - iter 1060/2650 - loss 0.01559818 - time (sec): 77.19 - samples/sec: 7566.23 - lr: 0.012500
2023-04-06 05:31:56,752 epoch 129 - iter 1325/2650 - loss 0.01520606 - time (sec): 96.42 - samples/sec: 7571.29 - lr: 0.012500
2023-04-06 05:32:15,710 epoch 129 - iter 1590/2650 - loss 0.01510911 - time (sec): 115.38 - samples/sec: 7591.14 - lr: 0.012500
2023-04-06 05:32:35,439 epoch 129 - iter 1855/2650 - loss 0.01509652 - time (sec): 135.10 - samples/sec: 7573.27 - lr: 0.012500
2023-04-06 05:32:54,959 epoch 129 - iter 2120/2650 - loss 0.01495564 - time (sec): 154.62 - samples/sec: 7563.53 - lr: 0.012500
2023-04-06 05:33:14,305 epoch 129 - iter 2385/2650 - loss 0.01505397 - time (sec): 173.97 - samples/sec: 7566.63 - lr: 0.012500
2023-04-06 05:33:33,699 epoch 129 - iter 2650/2650 - loss 0.01503245 - time (sec): 193.36 - samples/sec: 7562.72 - lr: 0.012500
2023-04-06 05:33:33,699 ----------------------------------------------------------------------------------------------------
2023-04-06 05:33:33,700 EPOCH 129 done: loss 0.0150 - lr 0.012500
2023-04-06 05:33:33,700 BAD EPOCHS (no improvement): 1
2023-04-06 05:33:33,704 ----------------------------------------------------------------------------------------------------
2023-04-06 05:33:52,916 epoch 130 - iter 265/2650 - loss 0.01440739 - time (sec): 19.21 - samples/sec: 7625.30 - lr: 0.012500
2023-04-06 05:34:12,542 epoch 130 - iter 530/2650 - loss 0.01467286 - time (sec): 38.84 - samples/sec: 7583.85 - lr: 0.012500
2023-04-06 05:34:31,842 epoch 130 - iter 795/2650 - loss 0.01481253 - time (sec): 58.14 - samples/sec: 7606.58 - lr: 0.012500
2023-04-06 05:34:50,991 epoch 130 - iter 1060/2650 - loss 0.01486888 - time (sec): 77.29 - samples/sec: 7587.94 - lr: 0.012500
2023-04-06 05:35:09,945 epoch 130 - iter 1325/2650 - loss 0.01495354 - time (sec): 96.24 - samples/sec: 7615.37 - lr: 0.012500
2023-04-06 05:35:29,735 epoch 130 - iter 1590/2650 - loss 0.01486580 - time (sec): 116.03 - samples/sec: 7571.54 - lr: 0.012500
2023-04-06 05:35:48,574 epoch 130 - iter 1855/2650 - loss 0.01489353 - time (sec): 134.87 - samples/sec: 7592.53 - lr: 0.012500
2023-04-06 05:36:07,525 epoch 130 - iter 2120/2650 - loss 0.01496772 - time (sec): 153.82 - samples/sec: 7606.63 - lr: 0.012500
2023-04-06 05:36:26,664 epoch 130 - iter 2385/2650 - loss 0.01511212 - time (sec): 172.96 - samples/sec: 7605.53 - lr: 0.012500
2023-04-06 05:36:46,245 epoch 130 - iter 2650/2650 - loss 0.01508575 - time (sec): 192.54 - samples/sec: 7595.07 - lr: 0.012500
2023-04-06 05:36:46,245 ----------------------------------------------------------------------------------------------------
2023-04-06 05:36:46,245 EPOCH 130 done: loss 0.0151 - lr 0.012500
2023-04-06 05:36:46,245 BAD EPOCHS (no improvement): 2
2023-04-06 05:36:46,248 ----------------------------------------------------------------------------------------------------
2023-04-06 05:37:05,511 epoch 131 - iter 265/2650 - loss 0.01446491 - time (sec): 19.26 - samples/sec: 7560.79 - lr: 0.012500
2023-04-06 05:37:25,319 epoch 131 - iter 530/2650 - loss 0.01506208 - time (sec): 39.07 - samples/sec: 7537.75 - lr: 0.012500
2023-04-06 05:37:44,215 epoch 131 - iter 795/2650 - loss 0.01514512 - time (sec): 57.97 - samples/sec: 7591.27 - lr: 0.012500
2023-04-06 05:38:03,157 epoch 131 - iter 1060/2650 - loss 0.01489944 - time (sec): 76.91 - samples/sec: 7599.56 - lr: 0.012500
2023-04-06 05:38:22,366 epoch 131 - iter 1325/2650 - loss 0.01488990 - time (sec): 96.12 - samples/sec: 7593.80 - lr: 0.012500
2023-04-06 05:38:41,400 epoch 131 - iter 1590/2650 - loss 0.01471249 - time (sec): 115.15 - samples/sec: 7597.70 - lr: 0.012500
2023-04-06 05:39:00,492 epoch 131 - iter 1855/2650 - loss 0.01469313 - time (sec): 134.24 - samples/sec: 7609.12 - lr: 0.012500
2023-04-06 05:39:20,145 epoch 131 - iter 2120/2650 - loss 0.01469184 - time (sec): 153.90 - samples/sec: 7588.20 - lr: 0.012500
2023-04-06 05:39:39,345 epoch 131 - iter 2385/2650 - loss 0.01467354 - time (sec): 173.10 - samples/sec: 7594.57 - lr: 0.012500
2023-04-06 05:39:58,977 epoch 131 - iter 2650/2650 - loss 0.01470156 - time (sec): 192.73 - samples/sec: 7587.70 - lr: 0.012500
2023-04-06 05:39:58,977 ----------------------------------------------------------------------------------------------------
2023-04-06 05:39:58,977 EPOCH 131 done: loss 0.0147 - lr 0.012500
2023-04-06 05:39:58,977 BAD EPOCHS (no improvement): 3
2023-04-06 05:39:58,981 ----------------------------------------------------------------------------------------------------
2023-04-06 05:40:18,410 epoch 132 - iter 265/2650 - loss 0.01402472 - time (sec): 19.43 - samples/sec: 7546.64 - lr: 0.012500
2023-04-06 05:40:37,951 epoch 132 - iter 530/2650 - loss 0.01387620 - time (sec): 38.97 - samples/sec: 7546.59 - lr: 0.012500
2023-04-06 05:40:57,265 epoch 132 - iter 795/2650 - loss 0.01447668 - time (sec): 58.28 - samples/sec: 7539.39 - lr: 0.012500
2023-04-06 05:41:16,845 epoch 132 - iter 1060/2650 - loss 0.01441731 - time (sec): 77.86 - samples/sec: 7524.46 - lr: 0.012500
2023-04-06 05:41:36,534 epoch 132 - iter 1325/2650 - loss 0.01455027 - time (sec): 97.55 - samples/sec: 7518.21 - lr: 0.012500
2023-04-06 05:41:56,027 epoch 132 - iter 1590/2650 - loss 0.01477634 - time (sec): 117.05 - samples/sec: 7503.18 - lr: 0.012500
2023-04-06 05:42:15,953 epoch 132 - iter 1855/2650 - loss 0.01460992 - time (sec): 136.97 - samples/sec: 7488.91 - lr: 0.012500
2023-04-06 05:42:34,943 epoch 132 - iter 2120/2650 - loss 0.01463616 - time (sec): 155.96 - samples/sec: 7520.09 - lr: 0.012500
2023-04-06 05:42:53,826 epoch 132 - iter 2385/2650 - loss 0.01453154 - time (sec): 174.85 - samples/sec: 7539.21 - lr: 0.012500
2023-04-06 05:43:12,514 epoch 132 - iter 2650/2650 - loss 0.01462108 - time (sec): 193.53 - samples/sec: 7556.13 - lr: 0.012500
2023-04-06 05:43:12,514 ----------------------------------------------------------------------------------------------------
2023-04-06 05:43:12,514 EPOCH 132 done: loss 0.0146 - lr 0.012500
2023-04-06 05:43:12,514 BAD EPOCHS (no improvement): 0
2023-04-06 05:43:12,518 ----------------------------------------------------------------------------------------------------
2023-04-06 05:43:31,289 epoch 133 - iter 265/2650 - loss 0.01458705 - time (sec): 18.77 - samples/sec: 7711.86 - lr: 0.012500
2023-04-06 05:43:50,662 epoch 133 - iter 530/2650 - loss 0.01442090 - time (sec): 38.14 - samples/sec: 7656.27 - lr: 0.012500
2023-04-06 05:44:09,901 epoch 133 - iter 795/2650 - loss 0.01468548 - time (sec): 57.38 - samples/sec: 7641.48 - lr: 0.012500
2023-04-06 05:44:29,827 epoch 133 - iter 1060/2650 - loss 0.01478404 - time (sec): 77.31 - samples/sec: 7580.43 - lr: 0.012500
2023-04-06 05:44:49,384 epoch 133 - iter 1325/2650 - loss 0.01488169 - time (sec): 96.87 - samples/sec: 7578.17 - lr: 0.012500
2023-04-06 05:45:09,169 epoch 133 - iter 1590/2650 - loss 0.01482854 - time (sec): 116.65 - samples/sec: 7555.91 - lr: 0.012500
2023-04-06 05:45:28,197 epoch 133 - iter 1855/2650 - loss 0.01475581 - time (sec): 135.68 - samples/sec: 7571.01 - lr: 0.012500
2023-04-06 05:45:47,271 epoch 133 - iter 2120/2650 - loss 0.01470370 - time (sec): 154.75 - samples/sec: 7573.67 - lr: 0.012500
2023-04-06 05:46:06,677 epoch 133 - iter 2385/2650 - loss 0.01462834 - time (sec): 174.16 - samples/sec: 7566.31 - lr: 0.012500
2023-04-06 05:46:25,642 epoch 133 - iter 2650/2650 - loss 0.01453582 - time (sec): 193.12 - samples/sec: 7572.14 - lr: 0.012500
2023-04-06 05:46:25,643 ----------------------------------------------------------------------------------------------------
2023-04-06 05:46:25,643 EPOCH 133 done: loss 0.0145 - lr 0.012500
2023-04-06 05:46:25,643 BAD EPOCHS (no improvement): 0
2023-04-06 05:46:25,646 ----------------------------------------------------------------------------------------------------
2023-04-06 05:46:44,673 epoch 134 - iter 265/2650 - loss 0.01378162 - time (sec): 19.03 - samples/sec: 7633.28 - lr: 0.012500
2023-04-06 05:47:03,953 epoch 134 - iter 530/2650 - loss 0.01430512 - time (sec): 38.31 - samples/sec: 7609.09 - lr: 0.012500
2023-04-06 05:47:22,821 epoch 134 - iter 795/2650 - loss 0.01462161 - time (sec): 57.17 - samples/sec: 7646.50 - lr: 0.012500
2023-04-06 05:47:42,398 epoch 134 - iter 1060/2650 - loss 0.01462484 - time (sec): 76.75 - samples/sec: 7612.45 - lr: 0.012500
2023-04-06 05:48:01,903 epoch 134 - iter 1325/2650 - loss 0.01463830 - time (sec): 96.26 - samples/sec: 7583.10 - lr: 0.012500
2023-04-06 05:48:21,970 epoch 134 - iter 1590/2650 - loss 0.01461591 - time (sec): 116.32 - samples/sec: 7553.78 - lr: 0.012500
2023-04-06 05:48:40,954 epoch 134 - iter 1855/2650 - loss 0.01471125 - time (sec): 135.31 - samples/sec: 7562.39 - lr: 0.012500
2023-04-06 05:49:00,216 epoch 134 - iter 2120/2650 - loss 0.01470502 - time (sec): 154.57 - samples/sec: 7562.36 - lr: 0.012500
2023-04-06 05:49:19,917 epoch 134 - iter 2385/2650 - loss 0.01477117 - time (sec): 174.27 - samples/sec: 7549.15 - lr: 0.012500
2023-04-06 05:49:39,209 epoch 134 - iter 2650/2650 - loss 0.01479876 - time (sec): 193.56 - samples/sec: 7555.00 - lr: 0.012500
2023-04-06 05:49:39,209 ----------------------------------------------------------------------------------------------------
2023-04-06 05:49:39,209 EPOCH 134 done: loss 0.0148 - lr 0.012500
2023-04-06 05:49:39,209 BAD EPOCHS (no improvement): 1
2023-04-06 05:49:39,213 ----------------------------------------------------------------------------------------------------
2023-04-06 05:49:58,684 epoch 135 - iter 265/2650 - loss 0.01536626 - time (sec): 19.47 - samples/sec: 7510.91 - lr: 0.012500
2023-04-06 05:50:17,766 epoch 135 - iter 530/2650 - loss 0.01467966 - time (sec): 38.55 - samples/sec: 7599.05 - lr: 0.012500
2023-04-06 05:50:36,752 epoch 135 - iter 795/2650 - loss 0.01483821 - time (sec): 57.54 - samples/sec: 7636.84 - lr: 0.012500
2023-04-06 05:50:56,102 epoch 135 - iter 1060/2650 - loss 0.01454432 - time (sec): 76.89 - samples/sec: 7629.91 - lr: 0.012500
2023-04-06 05:51:14,508 epoch 135 - iter 1325/2650 - loss 0.01465525 - time (sec): 95.29 - samples/sec: 7662.09 - lr: 0.012500
2023-04-06 05:51:34,360 epoch 135 - iter 1590/2650 - loss 0.01470205 - time (sec): 115.15 - samples/sec: 7613.35 - lr: 0.012500
2023-04-06 05:51:54,301 epoch 135 - iter 1855/2650 - loss 0.01484550 - time (sec): 135.09 - samples/sec: 7579.60 - lr: 0.012500
2023-04-06 05:52:13,391 epoch 135 - iter 2120/2650 - loss 0.01474346 - time (sec): 154.18 - samples/sec: 7587.99 - lr: 0.012500
2023-04-06 05:52:33,126 epoch 135 - iter 2385/2650 - loss 0.01471970 - time (sec): 173.91 - samples/sec: 7572.28 - lr: 0.012500
2023-04-06 05:52:52,946 epoch 135 - iter 2650/2650 - loss 0.01456113 - time (sec): 193.73 - samples/sec: 7548.36 - lr: 0.012500
2023-04-06 05:52:52,947 ----------------------------------------------------------------------------------------------------
2023-04-06 05:52:52,947 EPOCH 135 done: loss 0.0146 - lr 0.012500
2023-04-06 05:52:52,947 BAD EPOCHS (no improvement): 2
2023-04-06 05:52:52,950 ----------------------------------------------------------------------------------------------------
2023-04-06 05:53:12,206 epoch 136 - iter 265/2650 - loss 0.01441718 - time (sec): 19.26 - samples/sec: 7606.99 - lr: 0.012500
2023-04-06 05:53:31,136 epoch 136 - iter 530/2650 - loss 0.01404814 - time (sec): 38.19 - samples/sec: 7637.20 - lr: 0.012500
2023-04-06 05:53:50,751 epoch 136 - iter 795/2650 - loss 0.01430622 - time (sec): 57.80 - samples/sec: 7567.07 - lr: 0.012500
2023-04-06 05:54:09,669 epoch 136 - iter 1060/2650 - loss 0.01443636 - time (sec): 76.72 - samples/sec: 7617.86 - lr: 0.012500
2023-04-06 05:54:29,302 epoch 136 - iter 1325/2650 - loss 0.01435117 - time (sec): 96.35 - samples/sec: 7596.06 - lr: 0.012500
2023-04-06 05:54:48,401 epoch 136 - iter 1590/2650 - loss 0.01442731 - time (sec): 115.45 - samples/sec: 7603.86 - lr: 0.012500
2023-04-06 05:55:08,104 epoch 136 - iter 1855/2650 - loss 0.01440801 - time (sec): 135.15 - samples/sec: 7585.67 - lr: 0.012500
2023-04-06 05:55:27,737 epoch 136 - iter 2120/2650 - loss 0.01439947 - time (sec): 154.79 - samples/sec: 7568.15 - lr: 0.012500
2023-04-06 05:55:56,689 epoch 136 - iter 2385/2650 - loss 0.01448215 - time (sec): 183.74 - samples/sec: 7162.90 - lr: 0.012500
2023-04-06 05:56:16,139 epoch 136 - iter 2650/2650 - loss 0.01442108 - time (sec): 203.19 - samples/sec: 7197.04 - lr: 0.012500
2023-04-06 05:56:16,140 ----------------------------------------------------------------------------------------------------
2023-04-06 05:56:16,140 EPOCH 136 done: loss 0.0144 - lr 0.012500
2023-04-06 05:56:16,140 BAD EPOCHS (no improvement): 0
2023-04-06 05:56:16,143 ----------------------------------------------------------------------------------------------------
2023-04-06 05:56:35,544 epoch 137 - iter 265/2650 - loss 0.01418556 - time (sec): 19.40 - samples/sec: 7487.81 - lr: 0.012500
2023-04-06 05:56:54,880 epoch 137 - iter 530/2650 - loss 0.01440564 - time (sec): 38.74 - samples/sec: 7553.99 - lr: 0.012500
2023-04-06 05:57:14,854 epoch 137 - iter 795/2650 - loss 0.01466204 - time (sec): 58.71 - samples/sec: 7470.50 - lr: 0.012500
2023-04-06 05:57:34,415 epoch 137 - iter 1060/2650 - loss 0.01454787 - time (sec): 78.27 - samples/sec: 7494.09 - lr: 0.012500
2023-04-06 05:57:53,576 epoch 137 - iter 1325/2650 - loss 0.01459829 - time (sec): 97.43 - samples/sec: 7517.48 - lr: 0.012500
2023-04-06 05:58:12,892 epoch 137 - iter 1590/2650 - loss 0.01479954 - time (sec): 116.75 - samples/sec: 7532.83 - lr: 0.012500
2023-04-06 05:58:31,880 epoch 137 - iter 1855/2650 - loss 0.01476799 - time (sec): 135.74 - samples/sec: 7555.51 - lr: 0.012500
2023-04-06 05:58:51,109 epoch 137 - iter 2120/2650 - loss 0.01470614 - time (sec): 154.97 - samples/sec: 7550.95 - lr: 0.012500
2023-04-06 05:59:10,367 epoch 137 - iter 2385/2650 - loss 0.01464214 - time (sec): 174.22 - samples/sec: 7554.78 - lr: 0.012500
2023-04-06 05:59:29,923 epoch 137 - iter 2650/2650 - loss 0.01458605 - time (sec): 193.78 - samples/sec: 7546.53 - lr: 0.012500
2023-04-06 05:59:29,923 ----------------------------------------------------------------------------------------------------
2023-04-06 05:59:29,923 EPOCH 137 done: loss 0.0146 - lr 0.012500
2023-04-06 05:59:29,923 BAD EPOCHS (no improvement): 1
2023-04-06 05:59:29,925 ----------------------------------------------------------------------------------------------------
2023-04-06 05:59:49,431 epoch 138 - iter 265/2650 - loss 0.01387904 - time (sec): 19.51 - samples/sec: 7516.98 - lr: 0.012500
2023-04-06 06:00:08,708 epoch 138 - iter 530/2650 - loss 0.01413778 - time (sec): 38.78 - samples/sec: 7514.24 - lr: 0.012500
2023-04-06 06:00:28,031 epoch 138 - iter 795/2650 - loss 0.01417219 - time (sec): 58.11 - samples/sec: 7556.13 - lr: 0.012500
2023-04-06 06:00:47,304 epoch 138 - iter 1060/2650 - loss 0.01402947 - time (sec): 77.38 - samples/sec: 7557.51 - lr: 0.012500
2023-04-06 06:01:06,419 epoch 138 - iter 1325/2650 - loss 0.01410139 - time (sec): 96.49 - samples/sec: 7562.92 - lr: 0.012500
2023-04-06 06:01:25,520 epoch 138 - iter 1590/2650 - loss 0.01423930 - time (sec): 115.59 - samples/sec: 7585.03 - lr: 0.012500
2023-04-06 06:01:45,120 epoch 138 - iter 1855/2650 - loss 0.01419281 - time (sec): 135.19 - samples/sec: 7565.23 - lr: 0.012500
2023-04-06 06:02:04,645 epoch 138 - iter 2120/2650 - loss 0.01413906 - time (sec): 154.72 - samples/sec: 7560.99 - lr: 0.012500
2023-04-06 06:02:24,388 epoch 138 - iter 2385/2650 - loss 0.01407052 - time (sec): 174.46 - samples/sec: 7545.44 - lr: 0.012500
2023-04-06 06:02:44,140 epoch 138 - iter 2650/2650 - loss 0.01409925 - time (sec): 194.21 - samples/sec: 7529.62 - lr: 0.012500
2023-04-06 06:02:44,141 ----------------------------------------------------------------------------------------------------
2023-04-06 06:02:44,141 EPOCH 138 done: loss 0.0141 - lr 0.012500
2023-04-06 06:02:44,141 BAD EPOCHS (no improvement): 0
2023-04-06 06:02:44,145 ----------------------------------------------------------------------------------------------------
2023-04-06 06:03:03,505 epoch 139 - iter 265/2650 - loss 0.01371773 - time (sec): 19.36 - samples/sec: 7502.62 - lr: 0.012500
2023-04-06 06:03:22,884 epoch 139 - iter 530/2650 - loss 0.01398037 - time (sec): 38.74 - samples/sec: 7521.65 - lr: 0.012500
2023-04-06 06:03:42,336 epoch 139 - iter 795/2650 - loss 0.01426211 - time (sec): 58.19 - samples/sec: 7530.80 - lr: 0.012500
2023-04-06 06:04:01,748 epoch 139 - iter 1060/2650 - loss 0.01412886 - time (sec): 77.60 - samples/sec: 7530.13 - lr: 0.012500
2023-04-06 06:04:21,062 epoch 139 - iter 1325/2650 - loss 0.01436571 - time (sec): 96.92 - samples/sec: 7532.59 - lr: 0.012500
2023-04-06 06:04:40,471 epoch 139 - iter 1590/2650 - loss 0.01435607 - time (sec): 116.33 - samples/sec: 7544.25 - lr: 0.012500
2023-04-06 06:05:00,303 epoch 139 - iter 1855/2650 - loss 0.01431783 - time (sec): 136.16 - samples/sec: 7532.75 - lr: 0.012500
2023-04-06 06:05:19,735 epoch 139 - iter 2120/2650 - loss 0.01428623 - time (sec): 155.59 - samples/sec: 7538.18 - lr: 0.012500
2023-04-06 06:05:38,948 epoch 139 - iter 2385/2650 - loss 0.01427922 - time (sec): 174.80 - samples/sec: 7544.83 - lr: 0.012500
2023-04-06 06:05:58,023 epoch 139 - iter 2650/2650 - loss 0.01434115 - time (sec): 193.88 - samples/sec: 7542.70 - lr: 0.012500
2023-04-06 06:05:58,023 ----------------------------------------------------------------------------------------------------
2023-04-06 06:05:58,023 EPOCH 139 done: loss 0.0143 - lr 0.012500
2023-04-06 06:05:58,023 BAD EPOCHS (no improvement): 1
2023-04-06 06:05:58,026 ----------------------------------------------------------------------------------------------------
2023-04-06 06:06:17,155 epoch 140 - iter 265/2650 - loss 0.01388865 - time (sec): 19.13 - samples/sec: 7612.04 - lr: 0.012500
2023-04-06 06:06:36,950 epoch 140 - iter 530/2650 - loss 0.01412673 - time (sec): 38.92 - samples/sec: 7524.84 - lr: 0.012500
2023-04-06 06:06:56,634 epoch 140 - iter 795/2650 - loss 0.01444320 - time (sec): 58.61 - samples/sec: 7483.20 - lr: 0.012500
2023-04-06 06:07:15,709 epoch 140 - iter 1060/2650 - loss 0.01426230 - time (sec): 77.68 - samples/sec: 7497.09 - lr: 0.012500
2023-04-06 06:07:34,901 epoch 140 - iter 1325/2650 - loss 0.01404629 - time (sec): 96.87 - samples/sec: 7516.03 - lr: 0.012500
2023-04-06 06:07:54,177 epoch 140 - iter 1590/2650 - loss 0.01417075 - time (sec): 116.15 - samples/sec: 7537.23 - lr: 0.012500
2023-04-06 06:08:13,602 epoch 140 - iter 1855/2650 - loss 0.01410698 - time (sec): 135.58 - samples/sec: 7545.13 - lr: 0.012500
2023-04-06 06:08:32,973 epoch 140 - iter 2120/2650 - loss 0.01420417 - time (sec): 154.95 - samples/sec: 7543.83 - lr: 0.012500
2023-04-06 06:08:52,114 epoch 140 - iter 2385/2650 - loss 0.01431294 - time (sec): 174.09 - samples/sec: 7552.37 - lr: 0.012500
2023-04-06 06:09:11,976 epoch 140 - iter 2650/2650 - loss 0.01426967 - time (sec): 193.95 - samples/sec: 7539.93 - lr: 0.012500
2023-04-06 06:09:11,976 ----------------------------------------------------------------------------------------------------
2023-04-06 06:09:11,976 EPOCH 140 done: loss 0.0143 - lr 0.012500
2023-04-06 06:09:11,976 BAD EPOCHS (no improvement): 2
2023-04-06 06:09:11,979 ----------------------------------------------------------------------------------------------------
2023-04-06 06:09:31,085 epoch 141 - iter 265/2650 - loss 0.01483339 - time (sec): 19.11 - samples/sec: 7691.85 - lr: 0.012500
2023-04-06 06:09:50,692 epoch 141 - iter 530/2650 - loss 0.01421327 - time (sec): 38.71 - samples/sec: 7550.01 - lr: 0.012500
2023-04-06 06:10:09,939 epoch 141 - iter 795/2650 - loss 0.01421356 - time (sec): 57.96 - samples/sec: 7545.94 - lr: 0.012500
2023-04-06 06:10:28,557 epoch 141 - iter 1060/2650 - loss 0.01401051 - time (sec): 76.58 - samples/sec: 7598.22 - lr: 0.012500
2023-04-06 06:10:47,708 epoch 141 - iter 1325/2650 - loss 0.01386224 - time (sec): 95.73 - samples/sec: 7603.73 - lr: 0.012500
2023-04-06 06:11:07,253 epoch 141 - iter 1590/2650 - loss 0.01376792 - time (sec): 115.27 - samples/sec: 7589.31 - lr: 0.012500
2023-04-06 06:11:26,720 epoch 141 - iter 1855/2650 - loss 0.01388443 - time (sec): 134.74 - samples/sec: 7584.78 - lr: 0.012500
2023-04-06 06:11:45,849 epoch 141 - iter 2120/2650 - loss 0.01397421 - time (sec): 153.87 - samples/sec: 7586.05 - lr: 0.012500
2023-04-06 06:12:05,071 epoch 141 - iter 2385/2650 - loss 0.01401607 - time (sec): 173.09 - samples/sec: 7588.91 - lr: 0.012500
2023-04-06 06:12:25,643 epoch 141 - iter 2650/2650 - loss 0.01398971 - time (sec): 193.66 - samples/sec: 7551.04 - lr: 0.012500
2023-04-06 06:12:25,643 ----------------------------------------------------------------------------------------------------
2023-04-06 06:12:25,644 EPOCH 141 done: loss 0.0140 - lr 0.012500
2023-04-06 06:12:25,644 BAD EPOCHS (no improvement): 0
2023-04-06 06:12:25,647 ----------------------------------------------------------------------------------------------------
2023-04-06 06:12:45,272 epoch 142 - iter 265/2650 - loss 0.01492568 - time (sec): 19.63 - samples/sec: 7471.85 - lr: 0.012500
2023-04-06 06:13:04,403 epoch 142 - iter 530/2650 - loss 0.01491387 - time (sec): 38.76 - samples/sec: 7525.58 - lr: 0.012500
2023-04-06 06:13:23,827 epoch 142 - iter 795/2650 - loss 0.01493666 - time (sec): 58.18 - samples/sec: 7534.23 - lr: 0.012500
2023-04-06 06:13:43,009 epoch 142 - iter 1060/2650 - loss 0.01469487 - time (sec): 77.36 - samples/sec: 7552.93 - lr: 0.012500
2023-04-06 06:14:02,295 epoch 142 - iter 1325/2650 - loss 0.01457024 - time (sec): 96.65 - samples/sec: 7567.52 - lr: 0.012500
2023-04-06 06:14:21,615 epoch 142 - iter 1590/2650 - loss 0.01462338 - time (sec): 115.97 - samples/sec: 7580.24 - lr: 0.012500
2023-04-06 06:14:40,841 epoch 142 - iter 1855/2650 - loss 0.01455104 - time (sec): 135.19 - samples/sec: 7560.59 - lr: 0.012500
2023-04-06 06:15:00,266 epoch 142 - iter 2120/2650 - loss 0.01464407 - time (sec): 154.62 - samples/sec: 7567.31 - lr: 0.012500
2023-04-06 06:15:19,623 epoch 142 - iter 2385/2650 - loss 0.01442747 - time (sec): 173.98 - samples/sec: 7565.96 - lr: 0.012500
2023-04-06 06:15:39,477 epoch 142 - iter 2650/2650 - loss 0.01442749 - time (sec): 193.83 - samples/sec: 7544.58 - lr: 0.012500
2023-04-06 06:15:39,477 ----------------------------------------------------------------------------------------------------
2023-04-06 06:15:39,477 EPOCH 142 done: loss 0.0144 - lr 0.012500
2023-04-06 06:15:39,477 BAD EPOCHS (no improvement): 1
2023-04-06 06:15:39,480 ----------------------------------------------------------------------------------------------------
2023-04-06 06:15:58,813 epoch 143 - iter 265/2650 - loss 0.01342462 - time (sec): 19.33 - samples/sec: 7586.76 - lr: 0.012500
2023-04-06 06:16:17,904 epoch 143 - iter 530/2650 - loss 0.01399027 - time (sec): 38.42 - samples/sec: 7613.30 - lr: 0.012500
2023-04-06 06:16:37,406 epoch 143 - iter 795/2650 - loss 0.01393818 - time (sec): 57.93 - samples/sec: 7577.84 - lr: 0.012500
2023-04-06 06:16:57,208 epoch 143 - iter 1060/2650 - loss 0.01379454 - time (sec): 77.73 - samples/sec: 7524.77 - lr: 0.012500
2023-04-06 06:17:16,711 epoch 143 - iter 1325/2650 - loss 0.01415027 - time (sec): 97.23 - samples/sec: 7527.34 - lr: 0.012500
2023-04-06 06:17:35,816 epoch 143 - iter 1590/2650 - loss 0.01426226 - time (sec): 116.34 - samples/sec: 7564.19 - lr: 0.012500
2023-04-06 06:17:55,396 epoch 143 - iter 1855/2650 - loss 0.01416672 - time (sec): 135.92 - samples/sec: 7546.90 - lr: 0.012500
2023-04-06 06:18:14,599 epoch 143 - iter 2120/2650 - loss 0.01414412 - time (sec): 155.12 - samples/sec: 7560.38 - lr: 0.012500
2023-04-06 06:18:33,549 epoch 143 - iter 2385/2650 - loss 0.01417352 - time (sec): 174.07 - samples/sec: 7567.48 - lr: 0.012500
2023-04-06 06:18:52,945 epoch 143 - iter 2650/2650 - loss 0.01415801 - time (sec): 193.47 - samples/sec: 7558.80 - lr: 0.012500
2023-04-06 06:18:52,946 ----------------------------------------------------------------------------------------------------
2023-04-06 06:18:52,946 EPOCH 143 done: loss 0.0142 - lr 0.012500
2023-04-06 06:18:52,946 BAD EPOCHS (no improvement): 2
2023-04-06 06:18:52,950 ----------------------------------------------------------------------------------------------------
2023-04-06 06:19:13,230 epoch 144 - iter 265/2650 - loss 0.01439706 - time (sec): 20.28 - samples/sec: 7281.39 - lr: 0.012500
2023-04-06 06:19:32,097 epoch 144 - iter 530/2650 - loss 0.01460570 - time (sec): 39.15 - samples/sec: 7498.94 - lr: 0.012500
2023-04-06 06:19:51,181 epoch 144 - iter 795/2650 - loss 0.01447005 - time (sec): 58.23 - samples/sec: 7526.91 - lr: 0.012500
2023-04-06 06:20:10,808 epoch 144 - iter 1060/2650 - loss 0.01429584 - time (sec): 77.86 - samples/sec: 7518.26 - lr: 0.012500
2023-04-06 06:20:30,101 epoch 144 - iter 1325/2650 - loss 0.01433315 - time (sec): 97.15 - samples/sec: 7544.39 - lr: 0.012500
2023-04-06 06:20:49,528 epoch 144 - iter 1590/2650 - loss 0.01429718 - time (sec): 116.58 - samples/sec: 7542.67 - lr: 0.012500
2023-04-06 06:21:09,281 epoch 144 - iter 1855/2650 - loss 0.01432114 - time (sec): 136.33 - samples/sec: 7519.67 - lr: 0.012500
2023-04-06 06:21:28,496 epoch 144 - iter 2120/2650 - loss 0.01426102 - time (sec): 155.55 - samples/sec: 7528.75 - lr: 0.012500
2023-04-06 06:21:47,225 epoch 144 - iter 2385/2650 - loss 0.01412843 - time (sec): 174.28 - samples/sec: 7557.57 - lr: 0.012500
2023-04-06 06:22:06,502 epoch 144 - iter 2650/2650 - loss 0.01404888 - time (sec): 193.55 - samples/sec: 7555.41 - lr: 0.012500
2023-04-06 06:22:06,502 ----------------------------------------------------------------------------------------------------
2023-04-06 06:22:06,502 EPOCH 144 done: loss 0.0140 - lr 0.012500
2023-04-06 06:22:06,502 BAD EPOCHS (no improvement): 3
2023-04-06 06:22:06,506 ----------------------------------------------------------------------------------------------------
2023-04-06 06:22:25,653 epoch 145 - iter 265/2650 - loss 0.01311236 - time (sec): 19.15 - samples/sec: 7619.58 - lr: 0.012500
2023-04-06 06:22:44,408 epoch 145 - iter 530/2650 - loss 0.01345983 - time (sec): 37.90 - samples/sec: 7643.92 - lr: 0.012500
2023-04-06 06:23:03,812 epoch 145 - iter 795/2650 - loss 0.01331181 - time (sec): 57.31 - samples/sec: 7601.85 - lr: 0.012500
2023-04-06 06:23:24,110 epoch 145 - iter 1060/2650 - loss 0.01325242 - time (sec): 77.60 - samples/sec: 7538.83 - lr: 0.012500
2023-04-06 06:23:43,830 epoch 145 - iter 1325/2650 - loss 0.01335888 - time (sec): 97.32 - samples/sec: 7530.14 - lr: 0.012500
2023-04-06 06:24:02,494 epoch 145 - iter 1590/2650 - loss 0.01347614 - time (sec): 115.99 - samples/sec: 7558.09 - lr: 0.012500
2023-04-06 06:24:22,216 epoch 145 - iter 1855/2650 - loss 0.01356882 - time (sec): 135.71 - samples/sec: 7544.91 - lr: 0.012500
2023-04-06 06:24:41,584 epoch 145 - iter 2120/2650 - loss 0.01375189 - time (sec): 155.08 - samples/sec: 7547.38 - lr: 0.012500
2023-04-06 06:25:01,376 epoch 145 - iter 2385/2650 - loss 0.01385438 - time (sec): 174.87 - samples/sec: 7525.13 - lr: 0.012500
2023-04-06 06:25:20,693 epoch 145 - iter 2650/2650 - loss 0.01388776 - time (sec): 194.19 - samples/sec: 7530.71 - lr: 0.012500
2023-04-06 06:25:20,693 ----------------------------------------------------------------------------------------------------
2023-04-06 06:25:20,693 EPOCH 145 done: loss 0.0139 - lr 0.012500
2023-04-06 06:25:20,693 BAD EPOCHS (no improvement): 0
2023-04-06 06:25:20,697 ----------------------------------------------------------------------------------------------------
2023-04-06 06:25:40,333 epoch 146 - iter 265/2650 - loss 0.01455168 - time (sec): 19.64 - samples/sec: 7403.36 - lr: 0.012500
2023-04-06 06:25:59,169 epoch 146 - iter 530/2650 - loss 0.01398470 - time (sec): 38.47 - samples/sec: 7565.48 - lr: 0.012500
2023-04-06 06:26:18,869 epoch 146 - iter 795/2650 - loss 0.01401679 - time (sec): 58.17 - samples/sec: 7540.09 - lr: 0.012500
2023-04-06 06:26:38,052 epoch 146 - iter 1060/2650 - loss 0.01379408 - time (sec): 77.35 - samples/sec: 7549.56 - lr: 0.012500
2023-04-06 06:26:57,596 epoch 146 - iter 1325/2650 - loss 0.01412683 - time (sec): 96.90 - samples/sec: 7531.75 - lr: 0.012500
2023-04-06 06:27:16,587 epoch 146 - iter 1590/2650 - loss 0.01408474 - time (sec): 115.89 - samples/sec: 7563.33 - lr: 0.012500
2023-04-06 06:27:36,384 epoch 146 - iter 1855/2650 - loss 0.01411956 - time (sec): 135.69 - samples/sec: 7551.18 - lr: 0.012500
2023-04-06 06:27:56,469 epoch 146 - iter 2120/2650 - loss 0.01406314 - time (sec): 155.77 - samples/sec: 7515.15 - lr: 0.012500
2023-04-06 06:28:15,731 epoch 146 - iter 2385/2650 - loss 0.01403210 - time (sec): 175.03 - samples/sec: 7515.80 - lr: 0.012500
2023-04-06 06:28:34,767 epoch 146 - iter 2650/2650 - loss 0.01407356 - time (sec): 194.07 - samples/sec: 7535.24 - lr: 0.012500
2023-04-06 06:28:34,768 ----------------------------------------------------------------------------------------------------
2023-04-06 06:28:34,768 EPOCH 146 done: loss 0.0141 - lr 0.012500
2023-04-06 06:28:34,768 BAD EPOCHS (no improvement): 1
2023-04-06 06:28:34,771 ----------------------------------------------------------------------------------------------------
2023-04-06 06:28:54,217 epoch 147 - iter 265/2650 - loss 0.01418719 - time (sec): 19.45 - samples/sec: 7502.48 - lr: 0.012500
2023-04-06 06:29:13,710 epoch 147 - iter 530/2650 - loss 0.01411179 - time (sec): 38.94 - samples/sec: 7463.49 - lr: 0.012500
2023-04-06 06:29:33,020 epoch 147 - iter 795/2650 - loss 0.01412138 - time (sec): 58.25 - samples/sec: 7536.43 - lr: 0.012500
2023-04-06 06:29:52,262 epoch 147 - iter 1060/2650 - loss 0.01396798 - time (sec): 77.49 - samples/sec: 7551.24 - lr: 0.012500
2023-04-06 06:30:11,013 epoch 147 - iter 1325/2650 - loss 0.01399830 - time (sec): 96.24 - samples/sec: 7588.42 - lr: 0.012500
2023-04-06 06:30:30,030 epoch 147 - iter 1590/2650 - loss 0.01387271 - time (sec): 115.26 - samples/sec: 7599.16 - lr: 0.012500
2023-04-06 06:30:49,647 epoch 147 - iter 1855/2650 - loss 0.01384587 - time (sec): 134.88 - samples/sec: 7575.70 - lr: 0.012500
2023-04-06 06:31:09,479 epoch 147 - iter 2120/2650 - loss 0.01375414 - time (sec): 154.71 - samples/sec: 7552.97 - lr: 0.012500
2023-04-06 06:31:29,283 epoch 147 - iter 2385/2650 - loss 0.01386285 - time (sec): 174.51 - samples/sec: 7538.87 - lr: 0.012500
2023-04-06 06:31:48,474 epoch 147 - iter 2650/2650 - loss 0.01388996 - time (sec): 193.70 - samples/sec: 7549.55 - lr: 0.012500
2023-04-06 06:31:48,474 ----------------------------------------------------------------------------------------------------
2023-04-06 06:31:48,474 EPOCH 147 done: loss 0.0139 - lr 0.012500
2023-04-06 06:31:48,474 BAD EPOCHS (no improvement): 2
2023-04-06 06:31:48,478 ----------------------------------------------------------------------------------------------------
2023-04-06 06:32:07,471 epoch 148 - iter 265/2650 - loss 0.01381004 - time (sec): 18.99 - samples/sec: 7643.01 - lr: 0.012500
2023-04-06 06:32:27,043 epoch 148 - iter 530/2650 - loss 0.01378548 - time (sec): 38.56 - samples/sec: 7561.93 - lr: 0.012500
2023-04-06 06:32:46,643 epoch 148 - iter 795/2650 - loss 0.01400589 - time (sec): 58.16 - samples/sec: 7530.61 - lr: 0.012500
2023-04-06 06:33:05,984 epoch 148 - iter 1060/2650 - loss 0.01382955 - time (sec): 77.50 - samples/sec: 7517.63 - lr: 0.012500
2023-04-06 06:33:25,204 epoch 148 - iter 1325/2650 - loss 0.01385333 - time (sec): 96.73 - samples/sec: 7540.77 - lr: 0.012500
2023-04-06 06:33:44,251 epoch 148 - iter 1590/2650 - loss 0.01364464 - time (sec): 115.77 - samples/sec: 7572.58 - lr: 0.012500
2023-04-06 06:34:03,865 epoch 148 - iter 1855/2650 - loss 0.01356889 - time (sec): 135.39 - samples/sec: 7549.23 - lr: 0.012500
2023-04-06 06:34:23,007 epoch 148 - iter 2120/2650 - loss 0.01379351 - time (sec): 154.53 - samples/sec: 7558.19 - lr: 0.012500
2023-04-06 06:34:42,509 epoch 148 - iter 2385/2650 - loss 0.01379599 - time (sec): 174.03 - samples/sec: 7555.43 - lr: 0.012500
2023-04-06 06:35:02,098 epoch 148 - iter 2650/2650 - loss 0.01377780 - time (sec): 193.62 - samples/sec: 7552.78 - lr: 0.012500
2023-04-06 06:35:02,098 ----------------------------------------------------------------------------------------------------
2023-04-06 06:35:02,098 EPOCH 148 done: loss 0.0138 - lr 0.012500
2023-04-06 06:35:02,098 BAD EPOCHS (no improvement): 0
2023-04-06 06:35:02,102 ----------------------------------------------------------------------------------------------------
2023-04-06 06:35:21,337 epoch 149 - iter 265/2650 - loss 0.01349160 - time (sec): 19.23 - samples/sec: 7530.39 - lr: 0.012500
2023-04-06 06:35:40,832 epoch 149 - iter 530/2650 - loss 0.01332417 - time (sec): 38.73 - samples/sec: 7581.07 - lr: 0.012500
2023-04-06 06:35:59,732 epoch 149 - iter 795/2650 - loss 0.01344141 - time (sec): 57.63 - samples/sec: 7621.96 - lr: 0.012500
2023-04-06 06:36:18,867 epoch 149 - iter 1060/2650 - loss 0.01359838 - time (sec): 76.76 - samples/sec: 7605.84 - lr: 0.012500
2023-04-06 06:36:38,601 epoch 149 - iter 1325/2650 - loss 0.01373508 - time (sec): 96.50 - samples/sec: 7585.66 - lr: 0.012500
2023-04-06 06:36:58,180 epoch 149 - iter 1590/2650 - loss 0.01366690 - time (sec): 116.08 - samples/sec: 7574.86 - lr: 0.012500
2023-04-06 06:37:17,459 epoch 149 - iter 1855/2650 - loss 0.01368187 - time (sec): 135.36 - samples/sec: 7573.26 - lr: 0.012500
2023-04-06 06:37:36,823 epoch 149 - iter 2120/2650 - loss 0.01367709 - time (sec): 154.72 - samples/sec: 7573.34 - lr: 0.012500
2023-04-06 06:37:55,833 epoch 149 - iter 2385/2650 - loss 0.01376543 - time (sec): 173.73 - samples/sec: 7584.03 - lr: 0.012500
2023-04-06 06:38:15,407 epoch 149 - iter 2650/2650 - loss 0.01381202 - time (sec): 193.30 - samples/sec: 7565.07 - lr: 0.012500
2023-04-06 06:38:15,407 ----------------------------------------------------------------------------------------------------
2023-04-06 06:38:15,407 EPOCH 149 done: loss 0.0138 - lr 0.012500
2023-04-06 06:38:15,407 BAD EPOCHS (no improvement): 1
2023-04-06 06:38:15,411 ----------------------------------------------------------------------------------------------------
2023-04-06 06:38:34,363 epoch 150 - iter 265/2650 - loss 0.01307237 - time (sec): 18.95 - samples/sec: 7651.57 - lr: 0.012500
2023-04-06 06:38:53,718 epoch 150 - iter 530/2650 - loss 0.01364276 - time (sec): 38.31 - samples/sec: 7563.62 - lr: 0.012500
2023-04-06 06:39:12,803 epoch 150 - iter 795/2650 - loss 0.01353410 - time (sec): 57.39 - samples/sec: 7616.72 - lr: 0.012500
2023-04-06 06:39:32,501 epoch 150 - iter 1060/2650 - loss 0.01331620 - time (sec): 77.09 - samples/sec: 7579.86 - lr: 0.012500
2023-04-06 06:39:51,368 epoch 150 - iter 1325/2650 - loss 0.01338058 - time (sec): 95.96 - samples/sec: 7602.26 - lr: 0.012500
2023-04-06 06:40:11,198 epoch 150 - iter 1590/2650 - loss 0.01352856 - time (sec): 115.79 - samples/sec: 7567.79 - lr: 0.012500
2023-04-06 06:40:30,094 epoch 150 - iter 1855/2650 - loss 0.01353281 - time (sec): 134.68 - samples/sec: 7588.53 - lr: 0.012500
2023-04-06 06:40:49,339 epoch 150 - iter 2120/2650 - loss 0.01353930 - time (sec): 153.93 - samples/sec: 7591.61 - lr: 0.012500
2023-04-06 06:41:08,425 epoch 150 - iter 2385/2650 - loss 0.01353857 - time (sec): 173.01 - samples/sec: 7595.69 - lr: 0.012500
2023-04-06 06:41:28,425 epoch 150 - iter 2650/2650 - loss 0.01360721 - time (sec): 193.01 - samples/sec: 7576.47 - lr: 0.012500
2023-04-06 06:41:28,426 ----------------------------------------------------------------------------------------------------
2023-04-06 06:41:28,426 EPOCH 150 done: loss 0.0136 - lr 0.012500
2023-04-06 06:41:28,426 BAD EPOCHS (no improvement): 0
2023-04-06 06:41:36,515 ----------------------------------------------------------------------------------------------------
2023-04-06 06:41:36,515 Testing using last state of model ...
2023-04-06 06:42:14,804 Evaluating as a multi-label problem: False
2023-04-06 06:42:14,874 0.8966 0.8885 0.8926 0.8242
2023-04-06 06:42:14,874
Results:
- F-score (micro) 0.8926
- F-score (macro) 0.8017
- Accuracy 0.8242
By class:
precision recall f1-score support
GPE 0.9649 0.9576 0.9612 2240
PERSON 0.9410 0.9311 0.9360 1988
ORG 0.8985 0.8925 0.8955 1795
DATE 0.8679 0.8695 0.8687 1602
CARDINAL 0.8594 0.8299 0.8444 935
NORP 0.9010 0.9203 0.9106 841
PERCENT 0.9107 0.9054 0.9080 349
MONEY 0.9085 0.9172 0.9128 314
TIME 0.6605 0.6698 0.6651 212
ORDINAL 0.8020 0.8308 0.8161 195
LOC 0.7771 0.7598 0.7684 179
WORK_OF_ART 0.7007 0.6205 0.6581 166
FAC 0.7674 0.7333 0.7500 135
QUANTITY 0.7890 0.8190 0.8037 105
PRODUCT 0.7606 0.7105 0.7347 76
EVENT 0.6833 0.6508 0.6667 63
LAW 0.7241 0.5250 0.6087 40
LANGUAGE 0.9286 0.5909 0.7222 22
micro avg 0.8966 0.8885 0.8926 11257
macro avg 0.8247 0.7852 0.8017 11257
weighted avg 0.8962 0.8885 0.8921 11257
2023-04-06 06:42:14,875 ----------------------------------------------------------------------------------------------------