NowaBwagel0 commited on
Commit
dfe19ae
1 Parent(s): 8329d7f

End of training

Browse files
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
- base_model: NowaBwagel0/llama-68m-oasst
3
  license: other
 
4
  tags:
5
  - generated_from_trainer
6
  model-index:
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [NowaBwagel0/llama-68m-oasst](https://huggingface.co/NowaBwagel0/llama-68m-oasst) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 3.8987
19
 
20
  ## Model description
21
 
@@ -42,30 +42,21 @@ The following hyperparameters were used during training:
42
  - total_train_batch_size: 8
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
- - num_epochs: 18
46
 
47
  ### Training results
48
 
49
- | Training Loss | Epoch | Step | Validation Loss |
50
- |:-------------:|:-------:|:----:|:---------------:|
51
- | 0.97 | 0.9987 | 382 | 3.4996 |
52
- | 0.9273 | 2.0 | 765 | 3.5370 |
53
- | 0.9176 | 2.9987 | 1147 | 3.5715 |
54
- | 0.9004 | 4.0 | 1530 | 3.6086 |
55
- | 0.8736 | 4.9987 | 1912 | 3.6379 |
56
- | 0.8599 | 6.0 | 2295 | 3.6761 |
57
- | 0.7955 | 6.9987 | 2677 | 3.7044 |
58
- | 0.7741 | 8.0 | 3060 | 3.7346 |
59
- | 0.7364 | 8.9987 | 3442 | 3.7615 |
60
- | 0.7605 | 10.0 | 3825 | 3.7855 |
61
- | 0.695 | 10.9987 | 4207 | 3.8088 |
62
- | 0.7111 | 12.0 | 4590 | 3.8332 |
63
- | 0.6849 | 12.9987 | 4972 | 3.8490 |
64
- | 0.6862 | 14.0 | 5355 | 3.8659 |
65
- | 0.6834 | 14.9987 | 5737 | 3.8785 |
66
- | 0.6541 | 16.0 | 6120 | 3.8898 |
67
- | 0.646 | 16.9987 | 6502 | 3.8961 |
68
- | 0.6777 | 17.9765 | 6876 | 3.8987 |
69
 
70
 
71
  ### Framework versions
 
1
  ---
 
2
  license: other
3
+ base_model: NowaBwagel0/llama-68m-oasst
4
  tags:
5
  - generated_from_trainer
6
  model-index:
 
15
 
16
  This model is a fine-tuned version of [NowaBwagel0/llama-68m-oasst](https://huggingface.co/NowaBwagel0/llama-68m-oasst) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 4.1198
19
 
20
  ## Model description
21
 
 
42
  - total_train_batch_size: 8
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
+ - num_epochs: 9
46
 
47
  ### Training results
48
 
49
+ | Training Loss | Epoch | Step | Validation Loss |
50
+ |:-------------:|:------:|:----:|:---------------:|
51
+ | 0.7089 | 0.9987 | 382 | 3.9090 |
52
+ | 0.6716 | 2.0 | 765 | 3.9535 |
53
+ | 0.6583 | 2.9987 | 1147 | 3.9890 |
54
+ | 0.6402 | 4.0 | 1530 | 4.0211 |
55
+ | 0.6224 | 4.9987 | 1912 | 4.0493 |
56
+ | 0.6119 | 6.0 | 2295 | 4.0758 |
57
+ | 0.558 | 6.9987 | 2677 | 4.0987 |
58
+ | 0.5383 | 8.0 | 3060 | 4.1135 |
59
+ | 0.5506 | 8.9882 | 3438 | 4.1198 |
 
 
 
 
 
 
 
 
 
60
 
61
 
62
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:caa2c758e2e8c517efa7bb716beeafac8422348ac4abd51c9a1ee28cd9407b81
3
  size 272123144
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a76d2a5d06056bf73fbf5696f6c36edd2fcbc4adcce5d888f6663f19ee959b98
3
  size 272123144
runs/Jul07_14-38-54_Noah-Desktop/events.out.tfevents.1720381135.Noah-Desktop.4396.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:27b9f780ea0d87ec22930c494d4a7d226c6111cf798a2b25ebe22fc93a6302e6
3
- size 85861
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dbe8a2ebe0bf5d91fb5774740a44c73167e6653881c06ca25bc21476dcac8674
3
+ size 98151
runs/Jul07_14-38-54_Noah-Desktop/events.out.tfevents.1720386030.Noah-Desktop.4396.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:46a1a5860d3dc168564b4033fd9d067c6d6b8f16eccb6bec9bfbfae05fc3b333
3
+ size 359