dkshjn commited on
Commit
a03262f
1 Parent(s): d8c2e18

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -2
README.md CHANGED
@@ -12,7 +12,15 @@ library_name: adapter-transformers
12
 
13
  # Technoculture/MT7Bi-alpha-dpo-v-0.2
14
 
15
- ## DPO Training Dataset Details
 
 
 
 
 
 
 
 
16
  | Dataset Name | Original Size(Rows) | Ratio | Size After Ratio(Rows) |
17
  |----------------------------------------------------|---------------|-------|------------------|
18
  | argilla/distilabel-math-preference-dpo | 2.4k | 1.0 | 2.4k |
@@ -21,4 +29,11 @@ library_name: adapter-transformers
21
  | argilla/distilabel-capybara-dpo-7k-binarized | 7.5k | 0.2 | 1.5k |
22
  Total Size: 11.38k
23
 
24
- For full details of this dpo-training please read our notebook [MT7Bi-alpha-dpo-v0.2.ipynb](https://colab.research.google.com/drive/1nA0kHK8yGuz0IRlZD9pzsYYZnlL7LSeI?usp=sharing).
 
 
 
 
 
 
 
 
12
 
13
  # Technoculture/MT7Bi-alpha-dpo-v-0.2
14
 
15
+ ## Training Details
16
+
17
+ - **GPU:** Nvidia A100 Tensor Core GPU
18
+ - **Total Batches:** 4266
19
+ - **Epochs:** 3
20
+ - **Duration:** 3 hours, 59 minutes, and 55 seconds
21
+
22
+
23
+ ## DPO Training Dataset Mixture
24
  | Dataset Name | Original Size(Rows) | Ratio | Size After Ratio(Rows) |
25
  |----------------------------------------------------|---------------|-------|------------------|
26
  | argilla/distilabel-math-preference-dpo | 2.4k | 1.0 | 2.4k |
 
29
  | argilla/distilabel-capybara-dpo-7k-binarized | 7.5k | 0.2 | 1.5k |
30
  Total Size: 11.38k
31
 
32
+ ## Training Loss Plot
33
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/658bed1c8ff537204fbd92a3/CKi7ArBnCyuidJPHo3M5T.png)
34
+
35
+ ## Training Loss Smoothed Plot
36
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/658bed1c8ff537204fbd92a3/tFyGJLw3Vj3m2jaaWk66E.png)
37
+
38
+ ### For full details of this dpo-training please read our notebook [MT7Bi-alpha-dpo-v0.2.ipynb](https://colab.research.google.com/drive/1nA0kHK8yGuz0IRlZD9pzsYYZnlL7LSeI?usp=sharing).
39
+