End of training

Browse files

Files changed (4) hide show

README.md +34 -34
adapter_model.bin +1 -1
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0706
 ## Model description
@@ -50,39 +50,39 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 7.1779        | 0.09  | 10   | 2.1285          |
-| 1.3773        | 0.18  | 20   | 0.3216          |
-| 0.3479        | 0.27  | 30   | 0.2281          |
-| 0.9413        | 0.36  | 40   | 0.3169          |
-| 0.2771        | 0.45  | 50   | 0.1485          |
-| 0.332         | 0.54  | 60   | 0.1472          |
-| 0.2391        | 0.63  | 70   | 0.1359          |
-| 0.1792        | 0.73  | 80   | 0.1238          |
-| 0.1223        | 0.82  | 90   | 0.1178          |
-| 0.1279        | 0.91  | 100  | 0.0860          |
-| 0.109         | 1.0   | 110  | 0.0776          |
-| 0.0986        | 1.09  | 120  | 0.0769          |
-| 0.0989        | 1.18  | 130  | 0.0739          |
-| 0.117         | 1.27  | 140  | 0.0712          |
-| 0.1134        | 1.36  | 150  | 0.0686          |
-| 0.0768        | 1.45  | 160  | 0.0661          |
-| 0.0932        | 1.54  | 170  | 0.1176          |
-| 0.0865        | 1.63  | 180  | 0.0759          |
-| 0.0974        | 1.72  | 190  | 0.0680          |
-| 0.0831        | 1.81  | 200  | 0.0715          |
-| 0.0732        | 1.9   | 210  | 0.1637          |
-| 0.0756        | 1.99  | 220  | 0.0676          |
-| 0.0457        | 2.08  | 230  | 0.0696          |
-| 0.0551        | 2.18  | 240  | 0.0779          |
-| 0.0391        | 2.27  | 250  | 0.0772          |
-| 0.0401        | 2.36  | 260  | 0.0749          |
-| 0.0448        | 2.45  | 270  | 0.0707          |
-| 0.0422        | 2.54  | 280  | 0.0731          |
-| 0.037         | 2.63  | 290  | 0.0732          |
-| 0.039         | 2.72  | 300  | 0.0727          |
-| 0.0465        | 2.81  | 310  | 0.0718          |
-| 0.051         | 2.9   | 320  | 0.0707          |
-| 0.0416        | 2.99  | 330  | 0.0706          |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0643
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 7.2249        | 0.09  | 10   | 2.2001          |
+| 1.4719        | 0.18  | 20   | 0.3359          |
+| 0.3692        | 0.27  | 30   | 0.2930          |
+| 0.7802        | 0.36  | 40   | 0.2417          |
+| 0.3078        | 0.45  | 50   | 0.2185          |
+| 0.4702        | 0.54  | 60   | 0.2195          |
+| 0.272         | 0.63  | 70   | 0.1992          |
+| 0.2656        | 0.73  | 80   | 0.1711          |
+| 0.1386        | 0.82  | 90   | 0.1117          |
+| 0.2291        | 0.91  | 100  | 0.1116          |
+| 0.1424        | 1.0   | 110  | 0.0853          |
+| 0.099         | 1.09  | 120  | 0.1146          |
+| 0.1629        | 1.18  | 130  | 0.1753          |
+| 0.6955        | 1.27  | 140  | 0.1667          |
+| 0.226         | 1.36  | 150  | 0.1119          |
+| 0.1085        | 1.45  | 160  | 0.0805          |
+| 0.1083        | 1.54  | 170  | 0.0743          |
+| 0.2197        | 1.63  | 180  | 0.9735          |
+| 0.4915        | 1.72  | 190  | 0.0757          |
+| 0.0954        | 1.81  | 200  | 0.0794          |
+| 0.0696        | 1.9   | 210  | 0.0698          |
+| 0.068         | 1.99  | 220  | 0.0711          |
+| 0.0602        | 2.08  | 230  | 0.0702          |
+| 0.0896        | 2.18  | 240  | 0.0871          |
+| 0.0724        | 2.27  | 250  | 0.0720          |
+| 0.0679        | 2.36  | 260  | 0.0688          |
+| 0.0764        | 2.45  | 270  | 0.0683          |
+| 0.0642        | 2.54  | 280  | 0.0665          |
+| 0.058         | 2.63  | 290  | 0.0659          |
+| 0.0554        | 2.72  | 300  | 0.0665          |
+| 0.0699        | 2.81  | 310  | 0.0654          |
+| 0.0752        | 2.9   | 320  | 0.0645          |
+| 0.0654        | 2.99  | 330  | 0.0643          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4ea50a1b36f3ba5792be01a47fde95b626c05d3a7528146f3d63f734977a8b4f
 size 326956822

 version https://git-lfs.github.com/spec/v1
+oid sha256:7bec9dd95d20d9633b28e4bbe25a2a4bcf5b138dcd809fc795b5ec7affd84a01
 size 326956822

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:612ab6fb9e247280ca22d1520df8062533285e13a36d888e899d821f3390def2
 size 7868160832

 version https://git-lfs.github.com/spec/v1
+oid sha256:6328be5636f8af6bf29653a2d4d4afa804fb85e85a715fb56c4081c7b4a30d0a
 size 7868160832

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8d66d0ac97d7ef9aa8d21f01c25847ae7ce2774c44d4b6d55520ad29fa9d5e60
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:d9af9923bfc6d0722fc65e110e118d6e3ac524543ce1554185795c27f00b8d87
 size 5240