bart-finetuned-kwsylgen-64-simple_input_BARTlarge

Browse files

Files changed (3) hide show

README.md +73 -0
generation_config.json +24 -0
runs/Apr15_02-18-11_c00207873b47/events.out.tfevents.1713147493.c00207873b47.5037.0 +2 -2

README.md ADDED Viewed

	@@ -0,0 +1,73 @@

+---
+license: apache-2.0
+base_model: facebook/bart-large
+tags:
+- generated_from_trainer
+model-index:
+- name: bart-finetuned-kwsylgen-64-simple_input_BARTlarge
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# bart-finetuned-kwsylgen-64-simple_input_BARTlarge
+This model is a fine-tuned version of [facebook/bart-large](https://huggingface.co/facebook/bart-large) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.1785
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 64
+- eval_batch_size: 64
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 3
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 2.0641        | 0.18  | 500  | 0.2451          |
+| 0.2194        | 0.36  | 1000 | 0.2228          |
+| 0.1989        | 0.54  | 1500 | 0.2086          |
+| 0.1888        | 0.72  | 2000 | 0.2027          |
+| 0.177         | 0.9   | 2500 | 0.1976          |
+| 0.1703        | 1.08  | 3000 | 0.1933          |
+| 0.1647        | 1.26  | 3500 | 0.1928          |
+| 0.159         | 1.44  | 4000 | 0.1890          |
+| 0.1538        | 1.61  | 4500 | 0.1864          |
+| 0.151         | 1.79  | 5000 | 0.1857          |
+| 0.1471        | 1.97  | 5500 | 0.1828          |
+| 0.1436        | 2.15  | 6000 | 0.1814          |
+| 0.1435        | 2.33  | 6500 | 0.1806          |
+| 0.141         | 2.51  | 7000 | 0.1799          |
+| 0.1393        | 2.69  | 7500 | 0.1790          |
+| 0.1388        | 2.87  | 8000 | 0.1785          |
+### Framework versions
+- Transformers 4.38.2
+- Pytorch 2.2.1+cu121
+- Datasets 2.18.0
+- Tokenizers 0.15.2

generation_config.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+  "bos_token_id": 0,
+  "clean_up_tokenization_spaces": true,
+  "decoder_start_token_id": 2,
+  "do_sample": true,
+  "early_stopping": true,
+  "eos_token_id": 2,
+  "forced_bos_token_id": 0,
+  "forced_eos_token_id": 2,
+  "max_new_tokens": 64,
+  "n_examples": null,
+  "no_repeat_ngram_size": 2,
+  "num_beams": 4,
+  "pad_to_max_length": true,
+  "pad_token_id": 2,
+  "padding": "max_length",
+  "renormalize_logits": true,
+  "skip_special_tokens": true,
+  "temperature": 0.85,
+  "top_k": 0,
+  "top_p": 0.9,
+  "transformers_version": "4.38.2",
+  "truncation": true
+}

runs/Apr15_02-18-11_c00207873b47/events.out.tfevents.1713147493.c00207873b47.5037.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a6dcafe8f36b1f950161e5a055133eeaf9d74217dd39c742ce3bc89a83de3184
-size 13937

 version https://git-lfs.github.com/spec/v1
+oid sha256:af36b817bf78134f72d3c2a1a3e8ca23dd9978d10ff36a74807824dfd1ca3f5c
+size 14291