rb05751 commited on
Commit
5ea6815
1 Parent(s): 3501244

End of training

Browse files
Files changed (5) hide show
  1. README.md +4 -4
  2. config.json +1 -1
  3. pytorch_model.bin +1 -1
  4. tokenizer_config.json +4 -0
  5. training_args.bin +1 -1
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: mit
3
- base_model: gpt2
4
  tags:
5
  - generated_from_trainer
6
  model-index:
@@ -13,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # reuters-gpt2-text-gen
15
 
16
- This model is a trained from-scratch [gpt2](https://huggingface.co/gpt2) model on the Reuters dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 4.8502
19
 
20
  ## Model description
21
 
@@ -48,7 +48,7 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 5.2539 | 1.0 | 234 | 4.8502 |
52
 
53
 
54
  ### Framework versions
 
1
  ---
2
  license: mit
3
+ base_model: rb05751/reuters-gpt2-text-gen
4
  tags:
5
  - generated_from_trainer
6
  model-index:
 
13
 
14
  # reuters-gpt2-text-gen
15
 
16
+ This model is a fine-tuned version of [rb05751/reuters-gpt2-text-gen](https://huggingface.co/rb05751/reuters-gpt2-text-gen) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 4.2531
19
 
20
  ## Model description
21
 
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 4.8282 | 1.0 | 234 | 4.2531 |
52
 
53
 
54
  ### Framework versions
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "gpt2",
3
  "activation_function": "gelu_new",
4
  "architectures": [
5
  "GPT2LMHeadModel"
 
1
  {
2
+ "_name_or_path": "rb05751/reuters-gpt2-text-gen",
3
  "activation_function": "gelu_new",
4
  "architectures": [
5
  "GPT2LMHeadModel"
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b4e47d511360a17e3d5306971ae46b2c86a5b9ddb3ac28a874bdccb89c58183e
3
  size 503161693
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:987c09c4c12cbfb6a5bb05487a31abfd5cdc99b72534a1746dd98228402729f7
3
  size 503161693
tokenizer_config.json CHANGED
@@ -3,7 +3,11 @@
3
  "bos_token": "<|endoftext|>",
4
  "clean_up_tokenization_spaces": true,
5
  "eos_token": "<|endoftext|>",
 
6
  "model_max_length": 1024,
 
7
  "tokenizer_class": "GPT2Tokenizer",
 
 
8
  "unk_token": "<|endoftext|>"
9
  }
 
3
  "bos_token": "<|endoftext|>",
4
  "clean_up_tokenization_spaces": true,
5
  "eos_token": "<|endoftext|>",
6
+ "max_length": 512,
7
  "model_max_length": 1024,
8
+ "stride": 0,
9
  "tokenizer_class": "GPT2Tokenizer",
10
+ "truncation_side": "right",
11
+ "truncation_strategy": "longest_first",
12
  "unk_token": "<|endoftext|>"
13
  }
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4ad31608d9225a3bb630a2290fb394a7c999f1de8bf81357e08ed8248bc7af1b
3
  size 4091
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ffcc96379ddfc61ed6f164dfcaf5517209aaa6715a43b21be6de652dba080891
3
  size 4091