Chan-Y commited on
Commit
b1b9213
1 Parent(s): dba2abb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -40
README.md CHANGED
@@ -5,6 +5,7 @@ tags:
5
  - trl
6
  - sft
7
  - generated_from_trainer
 
8
  base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
9
  datasets: TIGER-Lab/MathInstruct
10
  model-index:
@@ -13,46 +14,6 @@ model-index:
13
  pipeline_tag: text-generation
14
  ---
15
 
16
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
- should probably proofread and complete it, then remove this comment. -->
18
-
19
  # llama3.1-8B-finetuned-Math
20
 
21
  This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) on the [TIGER-Lab/MathInstruct](https://huggingface.co/TIGER-Lab/MathInstruct) dataset.
22
-
23
- ## Model description
24
-
25
- More information needed
26
-
27
- ## Intended uses & limitations
28
-
29
- More information needed
30
-
31
- ## Training and evaluation data
32
-
33
- More information needed
34
-
35
- ## Training procedure
36
-
37
- ### Training hyperparameters
38
-
39
- The following hyperparameters were used during training:
40
- - learning_rate: 0.0002
41
- - train_batch_size: 4
42
- - eval_batch_size: 8
43
- - seed: 42
44
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
- - lr_scheduler_type: cosine
46
- - training_steps: 3
47
-
48
- ### Training results
49
-
50
-
51
-
52
- ### Framework versions
53
-
54
- - PEFT 0.12.0
55
- - Transformers 4.43.3
56
- - Pytorch 2.3.0+cu121
57
- - Datasets 2.19.1
58
- - Tokenizers 0.19.1
 
5
  - trl
6
  - sft
7
  - generated_from_trainer
8
+ - math
9
  base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
10
  datasets: TIGER-Lab/MathInstruct
11
  model-index:
 
14
  pipeline_tag: text-generation
15
  ---
16
 
 
 
 
17
  # llama3.1-8B-finetuned-Math
18
 
19
  This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) on the [TIGER-Lab/MathInstruct](https://huggingface.co/TIGER-Lab/MathInstruct) dataset.