End of training

Browse files

Files changed (4) hide show

README.md +34 -34
adapter_model.bin +1 -1
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0759
 ## Model description
@@ -50,39 +50,39 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 7.2811        | 0.09  | 10   | 2.3558          |
-| 1.5273        | 0.18  | 20   | 0.2947          |
-| 0.3667        | 0.27  | 30   | 0.3365          |
-| 1.5013        | 0.36  | 40   | 0.2249          |
-| 0.2882        | 0.45  | 50   | 0.1783          |
-| 0.3577        | 0.54  | 60   | 0.1499          |
-| 0.2395        | 0.63  | 70   | 0.1642          |
-| 0.1912        | 0.73  | 80   | 0.1321          |
-| 0.132         | 0.82  | 90   | 0.1249          |
-| 0.1411        | 0.91  | 100  | 0.0899          |
-| 0.1139        | 1.0   | 110  | 0.1044          |
-| 0.0781        | 1.09  | 120  | 0.0766          |
-| 0.0962        | 1.18  | 130  | 0.0736          |
-| 0.1109        | 1.27  | 140  | 0.0722          |
-| 0.1124        | 1.36  | 150  | 0.0697          |
-| 0.0764        | 1.45  | 160  | 0.0694          |
-| 0.0821        | 1.54  | 170  | 0.0685          |
-| 0.0711        | 1.63  | 180  | 0.0637          |
-| 0.0799        | 1.72  | 190  | 0.0680          |
-| 0.0826        | 1.81  | 200  | 0.0636          |
-| 0.0592        | 1.9   | 210  | 0.0650          |
-| 0.0584        | 1.99  | 220  | 0.0677          |
-| 0.0388        | 2.08  | 230  | 0.0705          |
-| 0.0501        | 2.18  | 240  | 0.0793          |
-| 0.0323        | 2.27  | 250  | 0.0846          |
-| 0.034         | 2.36  | 260  | 0.0803          |
-| 0.0413        | 2.45  | 270  | 0.0758          |
-| 0.0321        | 2.54  | 280  | 0.0769          |
-| 0.0315        | 2.63  | 290  | 0.0788          |
-| 0.0343        | 2.72  | 300  | 0.0777          |
-| 0.0404        | 2.81  | 310  | 0.0763          |
-| 0.0452        | 2.9   | 320  | 0.0758          |
-| 0.0369        | 2.99  | 330  | 0.0759          |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0747
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 7.2832        | 0.09  | 10   | 2.7337          |
+| 1.7648        | 0.18  | 20   | 0.3745          |
+| 0.3839        | 0.27  | 30   | 0.2589          |
+| 0.3285        | 0.36  | 40   | 0.2520          |
+| 0.3202        | 0.45  | 50   | 0.2229          |
+| 0.6502        | 0.54  | 60   | 0.2693          |
+| 0.3048        | 0.63  | 70   | 0.1647          |
+| 0.2068        | 0.73  | 80   | 0.1318          |
+| 0.1411        | 0.82  | 90   | 0.1621          |
+| 0.1775        | 0.91  | 100  | 0.0975          |
+| 0.1835        | 1.0   | 110  | 0.0954          |
+| 0.1014        | 1.09  | 120  | 0.0876          |
+| 0.1148        | 1.18  | 130  | 0.0976          |
+| 0.1506        | 1.27  | 140  | 0.0760          |
+| 0.128         | 1.36  | 150  | 0.0750          |
+| 0.0883        | 1.45  | 160  | 0.0736          |
+| 0.0913        | 1.54  | 170  | 0.0692          |
+| 0.0795        | 1.63  | 180  | 0.0681          |
+| 0.0927        | 1.72  | 190  | 0.0669          |
+| 0.087         | 1.81  | 200  | 0.0667          |
+| 0.0606        | 1.9   | 210  | 0.0682          |
+| 0.0627        | 1.99  | 220  | 0.0679          |
+| 0.0441        | 2.08  | 230  | 0.0705          |
+| 0.0543        | 2.18  | 240  | 0.0813          |
+| 0.0413        | 2.27  | 250  | 0.0839          |
+| 0.0414        | 2.36  | 260  | 0.0775          |
+| 0.0462        | 2.45  | 270  | 0.0756          |
+| 0.0411        | 2.54  | 280  | 0.0763          |
+| 0.0392        | 2.63  | 290  | 0.0768          |
+| 0.0407        | 2.72  | 300  | 0.0771          |
+| 0.0508        | 2.81  | 310  | 0.0755          |
+| 0.0577        | 2.9   | 320  | 0.0746          |
+| 0.0431        | 2.99  | 330  | 0.0747          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5ccd9fb9812c99c7334e454e8f5f0de9d593df0439edba9ea6d16c888fb7cae6
 size 326956822

 version https://git-lfs.github.com/spec/v1
+oid sha256:11c48e4becd0e809b2b198f9f2c25c495c339949ed7fa2fc8c53c802265bcbe7
 size 326956822

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7d9f9d31f611c81ef4d9fa49b7fb53550b9f65aa035d5218c62438488b37fd7c
 size 7868160832

 version https://git-lfs.github.com/spec/v1
+oid sha256:986b47a25cbf72a96e4414563aede6b73c9cd1a3918bc2b3452a7ddfb05cc411
 size 7868160832

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7cd229c65c2ea912677b65273c6a6fc251c1d1ec22a190a14833dc9724657aeb
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:e99e13116415c2e825a20d5d2f3e511e242dbd61c36b485f4a662a7da6e045ee
 size 5240