chansung commited on
Commit
52dca63
1 Parent(s): b645907

End of training

Browse files
README.md CHANGED
@@ -1,10 +1,11 @@
1
  ---
2
  base_model: mistralai/Mistral-7B-v0.3
3
  datasets:
4
- - generator
5
  library_name: peft
6
  license: apache-2.0
7
  tags:
 
8
  - trl
9
  - sft
10
  - generated_from_trainer
@@ -18,7 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  # mistral-7b-0.3-gpt4o_100k_summarize-lora
20
 
21
- This model is a fine-tuned version of [mistralai/Mistral-7B-v0.3](https://huggingface.co/mistralai/Mistral-7B-v0.3) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
  - Loss: 1.8165
24
 
 
1
  ---
2
  base_model: mistralai/Mistral-7B-v0.3
3
  datasets:
4
+ - llama-duo/synth_summarize_dataset_dedup
5
  library_name: peft
6
  license: apache-2.0
7
  tags:
8
+ - alignment-handbook
9
  - trl
10
  - sft
11
  - generated_from_trainer
 
19
 
20
  # mistral-7b-0.3-gpt4o_100k_summarize-lora
21
 
22
+ This model is a fine-tuned version of [mistralai/Mistral-7B-v0.3](https://huggingface.co/mistralai/Mistral-7B-v0.3) on the llama-duo/synth_summarize_dataset_dedup dataset.
23
  It achieves the following results on the evaluation set:
24
  - Loss: 1.8165
25
 
all_results.json CHANGED
@@ -1,5 +1,10 @@
1
  {
2
  "epoch": 1.0,
 
 
 
 
 
3
  "total_flos": 6.128945862132367e+17,
4
  "train_loss": 0.7845394540595138,
5
  "train_runtime": 3064.8006,
 
1
  {
2
  "epoch": 1.0,
3
+ "eval_loss": 1.8165334463119507,
4
+ "eval_runtime": 0.9325,
5
+ "eval_samples": 25,
6
+ "eval_samples_per_second": 12.868,
7
+ "eval_steps_per_second": 1.072,
8
  "total_flos": 6.128945862132367e+17,
9
  "train_loss": 0.7845394540595138,
10
  "train_runtime": 3064.8006,
eval_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "epoch": 1.0,
3
- "eval_loss": 1.8227200508117676,
4
- "eval_runtime": 1.5837,
5
  "eval_samples": 25,
6
- "eval_samples_per_second": 7.577,
7
- "eval_steps_per_second": 0.631
8
  }
 
1
  {
2
  "epoch": 1.0,
3
+ "eval_loss": 1.8165334463119507,
4
+ "eval_runtime": 0.9325,
5
  "eval_samples": 25,
6
+ "eval_samples_per_second": 12.868,
7
+ "eval_steps_per_second": 1.072
8
  }
runs/Sep29_04-57-25_637888cb4368/events.out.tfevents.1727589018.637888cb4368.915504.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:309c55be4158c591356778e8dac7b716400bb6fadf6d9c8cf3638ae28964a29b
3
+ size 359