CoCoRooXin commited on
Commit
7348013
1 Parent(s): 3bd729c

./lora_adapter_bart_on_eu

Browse files
Files changed (2) hide show
  1. README.md +40 -12
  2. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -4,6 +4,11 @@ library_name: peft
4
  tags:
5
  - generated_from_trainer
6
  base_model: facebook/bart-large-mnli
 
 
 
 
 
7
  model-index:
8
  - name: finetuned_bart
9
  results: []
@@ -16,16 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [facebook/bart-large-mnli](https://huggingface.co/facebook/bart-large-mnli) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - eval_loss: 0.1971
20
- - eval_f1: 0.9284
21
- - eval_precision: 0.9114
22
- - eval_recall: 0.9461
23
- - eval_accuracy: 0.9271
24
- - eval_runtime: 45.0597
25
- - eval_samples_per_second: 163.095
26
- - eval_steps_per_second: 2.552
27
- - epoch: 1.9403
28
- - step: 1040
29
 
30
  ## Model description
31
 
@@ -44,13 +44,41 @@ More information needed
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
- - learning_rate: 0.0002
48
  - train_batch_size: 64
49
  - eval_batch_size: 64
50
  - seed: 42
51
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
52
  - lr_scheduler_type: linear
53
- - num_epochs: 5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
54
 
55
  ### Framework versions
56
 
 
4
  tags:
5
  - generated_from_trainer
6
  base_model: facebook/bart-large-mnli
7
+ metrics:
8
+ - f1
9
+ - precision
10
+ - recall
11
+ - accuracy
12
  model-index:
13
  - name: finetuned_bart
14
  results: []
 
21
 
22
  This model is a fine-tuned version of [facebook/bart-large-mnli](https://huggingface.co/facebook/bart-large-mnli) on an unknown dataset.
23
  It achieves the following results on the evaluation set:
24
+ - Loss: 0.0620
25
+ - F1: 0.9236
26
+ - Precision: 0.9000
27
+ - Recall: 0.9485
28
+ - Accuracy: 0.9216
 
 
 
 
 
29
 
30
  ## Model description
31
 
 
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
+ - learning_rate: 2e-05
48
  - train_batch_size: 64
49
  - eval_batch_size: 64
50
  - seed: 42
51
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
52
  - lr_scheduler_type: linear
53
+ - lr_scheduler_warmup_steps: 20
54
+ - num_epochs: 2
55
+
56
+ ### Training results
57
+
58
+ | Training Loss | Epoch | Step | Validation Loss | F1 | Precision | Recall | Accuracy |
59
+ |:-------------:|:------:|:----:|:---------------:|:------:|:---------:|:------:|:--------:|
60
+ | 0.0856 | 0.0933 | 50 | 0.0695 | 0.9122 | 0.9010 | 0.9238 | 0.9111 |
61
+ | 0.0593 | 0.1866 | 100 | 0.0685 | 0.9152 | 0.8970 | 0.9341 | 0.9135 |
62
+ | 0.0572 | 0.2799 | 150 | 0.0681 | 0.9149 | 0.8997 | 0.9306 | 0.9135 |
63
+ | 0.0549 | 0.3731 | 200 | 0.0679 | 0.9150 | 0.9054 | 0.9249 | 0.9141 |
64
+ | 0.0529 | 0.4664 | 250 | 0.0678 | 0.9174 | 0.9043 | 0.9308 | 0.9162 |
65
+ | 0.0776 | 0.5597 | 300 | 0.0673 | 0.9158 | 0.9079 | 0.9238 | 0.9151 |
66
+ | 0.0799 | 0.6530 | 350 | 0.0647 | 0.9201 | 0.8964 | 0.9450 | 0.9179 |
67
+ | 0.0806 | 0.7463 | 400 | 0.0647 | 0.9196 | 0.8968 | 0.9436 | 0.9175 |
68
+ | 0.0781 | 0.8396 | 450 | 0.0635 | 0.9193 | 0.8982 | 0.9415 | 0.9174 |
69
+ | 0.0771 | 0.9328 | 500 | 0.0633 | 0.9189 | 0.9019 | 0.9366 | 0.9174 |
70
+ | 0.0787 | 1.0261 | 550 | 0.0629 | 0.9202 | 0.8994 | 0.9420 | 0.9184 |
71
+ | 0.0737 | 1.1194 | 600 | 0.0627 | 0.9210 | 0.8989 | 0.9442 | 0.9190 |
72
+ | 0.0722 | 1.2127 | 650 | 0.0634 | 0.9212 | 0.8981 | 0.9455 | 0.9192 |
73
+ | 0.0684 | 1.3060 | 700 | 0.0630 | 0.9217 | 0.9065 | 0.9374 | 0.9204 |
74
+ | 0.0655 | 1.3993 | 750 | 0.0629 | 0.9228 | 0.8974 | 0.9496 | 0.9205 |
75
+ | 0.0739 | 1.4925 | 800 | 0.0625 | 0.9229 | 0.8993 | 0.9477 | 0.9208 |
76
+ | 0.0666 | 1.5858 | 850 | 0.0625 | 0.9233 | 0.8962 | 0.9521 | 0.9209 |
77
+ | 0.0703 | 1.6791 | 900 | 0.0621 | 0.9238 | 0.9001 | 0.9488 | 0.9218 |
78
+ | 0.0738 | 1.7724 | 950 | 0.0617 | 0.9227 | 0.9007 | 0.9458 | 0.9208 |
79
+ | 0.068 | 1.8657 | 1000 | 0.0620 | 0.9233 | 0.9002 | 0.9477 | 0.9213 |
80
+ | 0.069 | 1.9590 | 1050 | 0.0620 | 0.9236 | 0.9000 | 0.9485 | 0.9216 |
81
+
82
 
83
  ### Framework versions
84
 
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cb3c0a44af6c1a485ef526d9b5b64f842f48be3b995bf9d4388848b65936a717
3
  size 29711564
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f9fa92d4f92262b0ece2b5f81fe59005879bf0a650bf57023318f0c93609151e
3
  size 29711564