markat1/mistral-7binstruct-summary-100s

Browse files

Files changed (4) hide show

README.md +37 -6
adapter_config.json +1 -1
adapter_model.safetensors +2 -2
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -20,12 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 It achieves the following results on the evaluation set:
-- eval_loss: 2.0198
-- eval_runtime: 34.7344
-- eval_samples_per_second: 2.706
-- eval_steps_per_second: 0.345
-- epoch: 0.06
-- step: 15
 ## Model description
@@ -55,6 +50,42 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 0.03
 - training_steps: 150
 ### Framework versions
 - PEFT 0.9.0

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6323
 ## Model description
 - lr_scheduler_warmup_steps: 0.03
 - training_steps: 150
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 2.5172        | 0.02  | 5    | 2.3926          |
+| 2.2822        | 0.04  | 10   | 2.1537          |
+| 2.1109        | 0.06  | 15   | 2.0087          |
+| 1.8571        | 0.08  | 20   | 1.9020          |
+| 1.8964        | 0.11  | 25   | 1.8310          |
+| 1.7335        | 0.13  | 30   | 1.7901          |
+| 1.7744        | 0.15  | 35   | 1.7607          |
+| 1.8654        | 0.17  | 40   | 1.7396          |
+| 1.7379        | 0.19  | 45   | 1.7235          |
+| 1.7442        | 0.21  | 50   | 1.7113          |
+| 1.6483        | 0.23  | 55   | 1.7011          |
+| 1.7006        | 0.25  | 60   | 1.6919          |
+| 1.6783        | 0.28  | 65   | 1.6833          |
+| 1.6468        | 0.3   | 70   | 1.6754          |
+| 1.6116        | 0.32  | 75   | 1.6678          |
+| 1.5899        | 0.34  | 80   | 1.6605          |
+| 1.7426        | 0.36  | 85   | 1.6538          |
+| 1.7244        | 0.38  | 90   | 1.6491          |
+| 1.6652        | 0.4   | 95   | 1.6457          |
+| 1.7859        | 0.42  | 100  | 1.6422          |
+| 1.5836        | 0.44  | 105  | 1.6395          |
+| 1.6265        | 0.47  | 110  | 1.6374          |
+| 1.5187        | 0.49  | 115  | 1.6358          |
+| 1.5989        | 0.51  | 120  | 1.6345          |
+| 1.684         | 0.53  | 125  | 1.6336          |
+| 1.6257        | 0.55  | 130  | 1.6329          |
+| 1.7211        | 0.57  | 135  | 1.6325          |
+| 1.6235        | 0.59  | 140  | 1.6324          |
+| 1.5885        | 0.61  | 145  | 1.6323          |
+| 1.5885        | 0.64  | 150  | 1.6323          |
 ### Framework versions
 - PEFT 0.9.0

adapter_config.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "alpha_pattern": {},
   "auto_mapping": null,
-  "base_model_name_or_path": "mistralai/Mistral-7B-Instruct-v0.2",
   "bias": "none",
   "fan_in_fan_out": false,
   "inference_mode": true,

 {
   "alpha_pattern": {},
   "auto_mapping": null,
+  "base_model_name_or_path": null,
   "bias": "none",
   "fan_in_fan_out": false,
   "inference_mode": true,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c60b8684e4fc174c64ee65169402174c73475b89d27b094a13d5b595591bc30c
-size 54543184

 version https://git-lfs.github.com/spec/v1
+oid sha256:080c32c9d89e81893201d215084cb22ab2f799026d585afbd68f47a470b902f4
+size 54545360

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e328bed096c7f01176f4ccaa21df959a73c03b0f481e67f80ac001bf2d078c94
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:cf7a5483a6ddb2a94c9643206dd99b771e6991fc0b125c048d43ce7d87461ad4
 size 4920