End of training

Browse files

Files changed (4) hide show

README.md +14 -23
model.safetensors +1 -1
runs/Jul07_14-38-54_Noah-Desktop/events.out.tfevents.1720381135.Noah-Desktop.4396.0 +2 -2
runs/Jul07_14-38-54_Noah-Desktop/events.out.tfevents.1720386030.Noah-Desktop.4396.1 +3 -0

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
-base_model: NowaBwagel0/llama-68m-oasst
 license: other
 tags:
 - generated_from_trainer
 model-index:
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [NowaBwagel0/llama-68m-oasst](https://huggingface.co/NowaBwagel0/llama-68m-oasst) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.8987
 ## Model description
@@ -42,30 +42,21 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 18
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss |
-|:-------------:|:-------:|:----:|:---------------:|
-| 0.97          | 0.9987  | 382  | 3.4996          |
-| 0.9273        | 2.0     | 765  | 3.5370          |
-| 0.9176        | 2.9987  | 1147 | 3.5715          |
-| 0.9004        | 4.0     | 1530 | 3.6086          |
-| 0.8736        | 4.9987  | 1912 | 3.6379          |
-| 0.8599        | 6.0     | 2295 | 3.6761          |
-| 0.7955        | 6.9987  | 2677 | 3.7044          |
-| 0.7741        | 8.0     | 3060 | 3.7346          |
-| 0.7364        | 8.9987  | 3442 | 3.7615          |
-| 0.7605        | 10.0    | 3825 | 3.7855          |
-| 0.695         | 10.9987 | 4207 | 3.8088          |
-| 0.7111        | 12.0    | 4590 | 3.8332          |
-| 0.6849        | 12.9987 | 4972 | 3.8490          |
-| 0.6862        | 14.0    | 5355 | 3.8659          |
-| 0.6834        | 14.9987 | 5737 | 3.8785          |
-| 0.6541        | 16.0    | 6120 | 3.8898          |
-| 0.646         | 16.9987 | 6502 | 3.8961          |
-| 0.6777        | 17.9765 | 6876 | 3.8987          |
 ### Framework versions

 ---
 license: other
+base_model: NowaBwagel0/llama-68m-oasst
 tags:
 - generated_from_trainer
 model-index:
 This model is a fine-tuned version of [NowaBwagel0/llama-68m-oasst](https://huggingface.co/NowaBwagel0/llama-68m-oasst) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.1198
 ## Model description
 - total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 9
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 0.7089        | 0.9987 | 382  | 3.9090          |
+| 0.6716        | 2.0    | 765  | 3.9535          |
+| 0.6583        | 2.9987 | 1147 | 3.9890          |
+| 0.6402        | 4.0    | 1530 | 4.0211          |
+| 0.6224        | 4.9987 | 1912 | 4.0493          |
+| 0.6119        | 6.0    | 2295 | 4.0758          |
+| 0.558         | 6.9987 | 2677 | 4.0987          |
+| 0.5383        | 8.0    | 3060 | 4.1135          |
+| 0.5506        | 8.9882 | 3438 | 4.1198          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:caa2c758e2e8c517efa7bb716beeafac8422348ac4abd51c9a1ee28cd9407b81
 size 272123144

 version https://git-lfs.github.com/spec/v1
+oid sha256:a76d2a5d06056bf73fbf5696f6c36edd2fcbc4adcce5d888f6663f19ee959b98
 size 272123144

runs/Jul07_14-38-54_Noah-Desktop/events.out.tfevents.1720381135.Noah-Desktop.4396.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:27b9f780ea0d87ec22930c494d4a7d226c6111cf798a2b25ebe22fc93a6302e6
-size 85861

 version https://git-lfs.github.com/spec/v1
+oid sha256:dbe8a2ebe0bf5d91fb5774740a44c73167e6653881c06ca25bc21476dcac8674
+size 98151

runs/Jul07_14-38-54_Noah-Desktop/events.out.tfevents.1720386030.Noah-Desktop.4396.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:46a1a5860d3dc168564b4033fd9d067c6d6b8f16eccb6bec9bfbfae05fc3b333
+size 359