End of training

Browse files

Files changed (5) hide show

README.md +34 -34
adapter_config.json +1 -1
adapter_model.bin +2 -2
model.safetensors +2 -2
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0856
 ## Model description
@@ -50,39 +50,39 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.766         | 0.09  | 10   | 0.4613          |
-| 0.3024        | 0.18  | 20   | 0.2538          |
-| 0.2469        | 0.27  | 30   | 0.2178          |
-| 0.261         | 0.36  | 40   | 0.2338          |
-| 0.482         | 0.45  | 50   | 0.2783          |
-| 0.2062        | 0.54  | 60   | 0.1451          |
-| 0.1438        | 0.63  | 70   | 0.1608          |
-| 0.1457        | 0.73  | 80   | 0.1216          |
-| 0.103         | 0.82  | 90   | 0.1031          |
-| 0.1028        | 0.91  | 100  | 0.0730          |
-| 0.0779        | 1.0   | 110  | 0.0722          |
-| 0.0572        | 1.09  | 120  | 0.0660          |
-| 0.0543        | 1.18  | 130  | 0.0833          |
-| 0.0669        | 1.27  | 140  | 0.0697          |
-| 0.0573        | 1.36  | 150  | 0.0711          |
-| 0.0636        | 1.45  | 160  | 0.0659          |
-| 0.0579        | 1.54  | 170  | 0.0680          |
-| 0.0595        | 1.63  | 180  | 0.0651          |
-| 0.0541        | 1.72  | 190  | 0.0692          |
-| 0.0579        | 1.81  | 200  | 0.0626          |
-| 0.0487        | 1.9   | 210  | 0.0663          |
-| 0.0501        | 1.99  | 220  | 0.0655          |
-| 0.0269        | 2.08  | 230  | 0.0733          |
-| 0.0241        | 2.18  | 240  | 0.0912          |
-| 0.0172        | 2.27  | 250  | 0.1040          |
-| 0.0171        | 2.36  | 260  | 0.0972          |
-| 0.0242        | 2.45  | 270  | 0.0892          |
-| 0.0153        | 2.54  | 280  | 0.0907          |
-| 0.0193        | 2.63  | 290  | 0.0904          |
-| 0.0252        | 2.72  | 300  | 0.0876          |
-| 0.0245        | 2.81  | 310  | 0.0862          |
-| 0.0202        | 2.9   | 320  | 0.0857          |
-| 0.0207        | 2.99  | 330  | 0.0856          |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0803
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.9264        | 0.09  | 10   | 0.4060          |
+| 0.5196        | 0.18  | 20   | 2.6047          |
+| 0.5297        | 0.27  | 30   | 0.2263          |
+| 0.2143        | 0.36  | 40   | 0.1990          |
+| 0.2823        | 0.45  | 50   | 0.2488          |
+| 0.2513        | 0.54  | 60   | 0.1911          |
+| 0.1606        | 0.63  | 70   | 0.1463          |
+| 0.1446        | 0.73  | 80   | 0.1406          |
+| 0.1202        | 0.82  | 90   | 0.1288          |
+| 0.1229        | 0.91  | 100  | 0.1081          |
+| 0.1123        | 1.0   | 110  | 0.1439          |
+| 0.123         | 1.09  | 120  | 0.1062          |
+| 0.0765        | 1.18  | 130  | 0.0812          |
+| 0.0736        | 1.27  | 140  | 0.0723          |
+| 0.0629        | 1.36  | 150  | 0.0730          |
+| 0.0554        | 1.45  | 160  | 0.0738          |
+| 0.0532        | 1.54  | 170  | 0.0671          |
+| 0.0595        | 1.63  | 180  | 0.0657          |
+| 0.0594        | 1.72  | 190  | 0.0681          |
+| 0.0613        | 1.81  | 200  | 0.0624          |
+| 0.0488        | 1.9   | 210  | 0.0623          |
+| 0.0576        | 1.99  | 220  | 0.0607          |
+| 0.0284        | 2.08  | 230  | 0.0712          |
+| 0.0171        | 2.18  | 240  | 0.1021          |
+| 0.0287        | 2.27  | 250  | 0.0831          |
+| 0.0209        | 2.36  | 260  | 0.0753          |
+| 0.0229        | 2.45  | 270  | 0.0752          |
+| 0.0209        | 2.54  | 280  | 0.0759          |
+| 0.0206        | 2.63  | 290  | 0.0773          |
+| 0.0199        | 2.72  | 300  | 0.0788          |
+| 0.0162        | 2.81  | 310  | 0.0796          |
+| 0.0181        | 2.9   | 320  | 0.0802          |
+| 0.0213        | 2.99  | 330  | 0.0803          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -2,7 +2,7 @@
   "adaptive_ratio": 0.01,
   "adaptive_ratio_decay": 1.0,
   "additive_modeling": false,
-  "allow_empty_lora": true,
   "auto_mapping": null,
   "base_model_name_or_path": "microsoft/Phi-3-mini-4k-instruct",
   "bias": "none",

   "adaptive_ratio": 0.01,
   "adaptive_ratio_decay": 1.0,
   "additive_modeling": false,
+  "allow_empty_lora": false,
   "auto_mapping": null,
   "base_model_name_or_path": "microsoft/Phi-3-mini-4k-instruct",
   "bias": "none",

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e892ec140136e3a12b6bf26026ecff3be71ba9e5096788c1b8f9d2db4fdb49be
-size 431155958

 version https://git-lfs.github.com/spec/v1
+oid sha256:a0f6a4610b5a028ff2cfae5abaae4fc097d311e30dd493daf1c49d9891038189
+size 430750193

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4f09142b8d998137b8cfd69edd34d37ae1da6c20a900c302cc8d23a0ddffdc53
-size 7921726216

 version https://git-lfs.github.com/spec/v1
+oid sha256:89305dac159408b747ab5327111ed2c1a22eb279198fc59330108bb157dcfb70
+size 7921324520

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e1d4d0fbbd7b90aa8d90deb273884f196cc13ef3f26c3799da308e6cf9acf1c2
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:cd1749e59c1e7250499bb4843cbdc6a89b6ce9679ba6d846786bb8e19c0f7d6b
 size 5176