rwang5688 commited on
Commit
5ba6ef7
1 Parent(s): 123d313

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 3.6420
19
 
20
  ## Model description
21
 
@@ -35,8 +35,8 @@ More information needed
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 2e-05
38
- - train_batch_size: 8
39
- - eval_batch_size: 8
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
@@ -46,14 +46,14 @@ The following hyperparameters were used during training:
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 3.7501 | 1.0 | 2334 | 3.6669 |
50
- | 3.6498 | 2.0 | 4668 | 3.6464 |
51
- | 3.6023 | 3.0 | 7002 | 3.6420 |
52
 
53
 
54
  ### Framework versions
55
 
56
  - Transformers 4.40.2
57
- - Pytorch 2.3.0+cu121
58
- - Datasets 2.20.0
59
  - Tokenizers 0.19.1
 
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 3.6666
19
 
20
  ## Model description
21
 
 
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 2e-05
38
+ - train_batch_size: 32
39
+ - eval_batch_size: 32
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
 
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | 3.9133 | 1.0 | 584 | 3.6984 |
50
+ | 3.7477 | 2.0 | 1168 | 3.6721 |
51
+ | 3.7063 | 3.0 | 1752 | 3.6666 |
52
 
53
 
54
  ### Framework versions
55
 
56
  - Transformers 4.40.2
57
+ - Pytorch 2.4.0+cu121
58
+ - Datasets 2.19.2
59
  - Tokenizers 0.19.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8fe83537b8e8436195f1f20e3988aeb74213e8325c659f2f2ea1bc9c785ae93b
3
  size 327657928
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1a4e11bfbfd6d2e3e3763b3907afd5d9996c6a648ea8e6abae5594263a1af823
3
  size 327657928
runs/Aug12_06-17-58_default/events.out.tfevents.1723443486.default.1690.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:acfb70d9176307ace9c718232f24bf5eb61c51bedb2c930265a81303cb2ecaad
3
- size 6230
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c3a94ac85162f220aa06ec81a856152b1ee02a48de80cb2d81eb3c02db185f17
3
+ size 6855
runs/Aug12_06-17-58_default/events.out.tfevents.1723444297.default.1690.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f6582acb7f746e76fb5173fcb2f3df313ce139653ed8c035767f195e96a8ce55
3
+ size 359