End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.6420
 ## Model description
@@ -35,8 +35,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -46,14 +46,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.7501        | 1.0   | 2334 | 3.6669          |
-| 3.6498        | 2.0   | 4668 | 3.6464          |
-| 3.6023        | 3.0   | 7002 | 3.6420          |
 ### Framework versions
 - Transformers 4.40.2
-- Pytorch 2.3.0+cu121
-- Datasets 2.20.0
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.6666
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 32
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.9133        | 1.0   | 584  | 3.6984          |
+| 3.7477        | 2.0   | 1168 | 3.6721          |
+| 3.7063        | 3.0   | 1752 | 3.6666          |
 ### Framework versions
 - Transformers 4.40.2
+- Pytorch 2.4.0+cu121
+- Datasets 2.19.2
 - Tokenizers 0.19.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8fe83537b8e8436195f1f20e3988aeb74213e8325c659f2f2ea1bc9c785ae93b
 size 327657928

 version https://git-lfs.github.com/spec/v1
+oid sha256:1a4e11bfbfd6d2e3e3763b3907afd5d9996c6a648ea8e6abae5594263a1af823
 size 327657928

runs/Aug12_06-17-58_default/events.out.tfevents.1723443486.default.1690.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:acfb70d9176307ace9c718232f24bf5eb61c51bedb2c930265a81303cb2ecaad
-size 6230

 version https://git-lfs.github.com/spec/v1
+oid sha256:c3a94ac85162f220aa06ec81a856152b1ee02a48de80cb2d81eb3c02db185f17
+size 6855

runs/Aug12_06-17-58_default/events.out.tfevents.1723444297.default.1690.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:f6582acb7f746e76fb5173fcb2f3df313ce139653ed8c035767f195e96a8ce55
+size 359