End of training
Browse files- README.md +9 -9
- model.safetensors +1 -1
- runs/Jun10_06-00-51_c19863619500/events.out.tfevents.1717999302.c19863619500.985.0 +3 -0
- runs/Jun10_06-00-51_c19863619500/events.out.tfevents.1717999368.c19863619500.985.1 +3 -0
- runs/Jun10_06-03-28_c19863619500/events.out.tfevents.1717999416.c19863619500.985.2 +3 -0
- runs/Jun10_06-03-28_c19863619500/events.out.tfevents.1717999517.c19863619500.985.3 +3 -0
- runs/Jun10_06-05-33_c19863619500/events.out.tfevents.1717999538.c19863619500.985.4 +3 -0
- runs/Jun10_06-05-33_c19863619500/events.out.tfevents.1717999628.c19863619500.985.5 +3 -0
- runs/Jun10_06-08-08_c19863619500/events.out.tfevents.1717999698.c19863619500.985.6 +3 -0
- runs/Jun10_06-10-07_c19863619500/events.out.tfevents.1717999813.c19863619500.985.7 +3 -0
- runs/Jun10_06-10-07_c19863619500/events.out.tfevents.1717999907.c19863619500.985.8 +3 -0
- training_args.bin +1 -1
README.md
CHANGED
@@ -14,12 +14,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
14 |
|
15 |
This model is a fine-tuned version of [airesearch/wangchanberta-base-att-spm-uncased](https://huggingface.co/airesearch/wangchanberta-base-att-spm-uncased) on an unknown dataset.
|
16 |
It achieves the following results on the evaluation set:
|
17 |
-
- eval_loss: 0.
|
18 |
-
- eval_accuracy: {'accuracy': 0.
|
19 |
-
- eval_f1score: {'f1': 0.
|
20 |
-
- eval_runtime:
|
21 |
-
- eval_samples_per_second:
|
22 |
-
- eval_steps_per_second: 16.
|
23 |
- step: 0
|
24 |
|
25 |
## Model description
|
@@ -39,14 +39,14 @@ More information needed
|
|
39 |
### Training hyperparameters
|
40 |
|
41 |
The following hyperparameters were used during training:
|
42 |
-
- learning_rate:
|
43 |
- train_batch_size: 8
|
44 |
- eval_batch_size: 8
|
45 |
- seed: 42
|
46 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
47 |
- lr_scheduler_type: linear
|
48 |
-
- lr_scheduler_warmup_steps:
|
49 |
-
- num_epochs:
|
50 |
|
51 |
### Framework versions
|
52 |
|
|
|
14 |
|
15 |
This model is a fine-tuned version of [airesearch/wangchanberta-base-att-spm-uncased](https://huggingface.co/airesearch/wangchanberta-base-att-spm-uncased) on an unknown dataset.
|
16 |
It achieves the following results on the evaluation set:
|
17 |
+
- eval_loss: 0.8371
|
18 |
+
- eval_accuracy: {'accuracy': 0.6541353383458647}
|
19 |
+
- eval_f1score: {'f1': 0.5173615857826385}
|
20 |
+
- eval_runtime: 1.0057
|
21 |
+
- eval_samples_per_second: 132.25
|
22 |
+
- eval_steps_per_second: 16.904
|
23 |
- step: 0
|
24 |
|
25 |
## Model description
|
|
|
39 |
### Training hyperparameters
|
40 |
|
41 |
The following hyperparameters were used during training:
|
42 |
+
- learning_rate: 1e-05
|
43 |
- train_batch_size: 8
|
44 |
- eval_batch_size: 8
|
45 |
- seed: 42
|
46 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
47 |
- lr_scheduler_type: linear
|
48 |
+
- lr_scheduler_warmup_steps: 66
|
49 |
+
- num_epochs: 10
|
50 |
|
51 |
### Framework versions
|
52 |
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 421011004
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2c675cc24e9398bf67e2585873dceb081aec26f6523bba80f13e7cf729eae869
|
3 |
size 421011004
|
runs/Jun10_06-00-51_c19863619500/events.out.tfevents.1717999302.c19863619500.985.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3ca84ad97ba4a1292cdb272fb04ff1d32367e73978084c130b6e57c3ce067dc1
|
3 |
+
size 8363
|
runs/Jun10_06-00-51_c19863619500/events.out.tfevents.1717999368.c19863619500.985.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:cbdd694202990b6b811debcbf5d36108729d8341cb0fa97b879d47fb68d28152
|
3 |
+
size 297
|
runs/Jun10_06-03-28_c19863619500/events.out.tfevents.1717999416.c19863619500.985.2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:17164eecb66cf479b8bac5d34044f12bec53b0776b00f449d0767b55403dd488
|
3 |
+
size 9679
|
runs/Jun10_06-03-28_c19863619500/events.out.tfevents.1717999517.c19863619500.985.3
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d3af0238834a113716b731b4cb4608ce76fd12118957f348178103cef40189be
|
3 |
+
size 297
|
runs/Jun10_06-05-33_c19863619500/events.out.tfevents.1717999538.c19863619500.985.4
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f894e8c6dd5df8e975d0bb93ba9e2bffba80d6238d26b359e4dcf5f4655f84c7
|
3 |
+
size 9679
|
runs/Jun10_06-05-33_c19863619500/events.out.tfevents.1717999628.c19863619500.985.5
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8511d2541aa7c07a8298d44e07b2a4a72bbd10d4a87eb89472fa8fdbe135d578
|
3 |
+
size 297
|
runs/Jun10_06-08-08_c19863619500/events.out.tfevents.1717999698.c19863619500.985.6
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ebb6387aa4e7897f2bdcb7e820a248d8aca759fdeef01eda254445cc7b4e0ef4
|
3 |
+
size 9680
|
runs/Jun10_06-10-07_c19863619500/events.out.tfevents.1717999813.c19863619500.985.7
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8cf2b5634c5363f60820895f3e846d699dea4f35cb70aa03f4ab4f608d37a230
|
3 |
+
size 9679
|
runs/Jun10_06-10-07_c19863619500/events.out.tfevents.1717999907.c19863619500.985.8
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d2767150b44e14ba07c8720993e1677e3aae95a9f5898067660fc46c768299bf
|
3 |
+
size 297
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5112
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:eba686fc4b5f28decb8034f1e7905e9d777e1ffd1bcf1c9f5e5d6f093892e0a6
|
3 |
size 5112
|