baptistecolle HF staff commited on
Commit
8de2310
1 Parent(s): cbffeef

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [Open-Orca/Mistral-7B-OpenOrca](https://huggingface.co/Open-Orca/Mistral-7B-OpenOrca) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.1075
19
 
20
  ## Model description
21
 
@@ -42,17 +42,16 @@ The following hyperparameters were used during training:
42
  - lr_scheduler_type: linear
43
  - lr_scheduler_warmup_steps: 1
44
  - training_steps: 500
45
- - mixed_precision_training: Native AMP
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 0.097 | 0.48 | 100 | 0.1216 |
52
- | 0.0795 | 0.95 | 200 | 0.1155 |
53
- | 0.0448 | 1.43 | 300 | 0.1142 |
54
- | 0.0516 | 1.9 | 400 | 0.1057 |
55
- | 0.0365 | 2.38 | 500 | 0.1075 |
56
 
57
 
58
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [Open-Orca/Mistral-7B-OpenOrca](https://huggingface.co/Open-Orca/Mistral-7B-OpenOrca) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.1123
19
 
20
  ## Model description
21
 
 
42
  - lr_scheduler_type: linear
43
  - lr_scheduler_warmup_steps: 1
44
  - training_steps: 500
 
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | 0.2728 | 0.48 | 100 | 0.1881 |
51
+ | 0.1854 | 0.95 | 200 | 0.1352 |
52
+ | 0.09 | 1.43 | 300 | 0.1251 |
53
+ | 0.0902 | 1.9 | 400 | 0.1121 |
54
+ | 0.062 | 2.38 | 500 | 0.1123 |
55
 
56
 
57
  ### Framework versions
adapter_config.json CHANGED
@@ -17,13 +17,13 @@
17
  "revision": null,
18
  "target_modules": [
19
  "o_proj",
20
- "gate_proj",
21
- "v_proj",
22
  "down_proj",
23
- "k_proj",
24
- "up_proj",
25
  "lm_head",
26
- "q_proj"
 
 
 
27
  ],
28
  "task_type": "CAUSAL_LM"
29
  }
 
17
  "revision": null,
18
  "target_modules": [
19
  "o_proj",
 
 
20
  "down_proj",
21
+ "q_proj",
 
22
  "lm_head",
23
+ "v_proj",
24
+ "gate_proj",
25
+ "up_proj",
26
+ "k_proj"
27
  ],
28
  "task_type": "CAUSAL_LM"
29
  }
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cc598f3e72c17e2104519e2a845ec3f137c235d6842cc279feae394654374724
3
  size 340225480
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bf8d98e8df7f05494677f12cbe67111ef7eb52de0013cbeead0cf52b57f07bfd
3
  size 340225480
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ddac5123a1f7ef3bb7ded41862e2bd168332ef032599656ec5ad785fcc46f053
3
- size 4664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:339cb40a33201baae08801e23f366086440bd7c0036cfe4e37d1e9457c001aeb
3
+ size 4600