sagorsarker commited on
Commit
e9e1766
1 Parent(s): 54dbf4f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -22,13 +22,17 @@ Notable training configs:
22
  - vocab_size: 72000
23
  - attn_impl: flash
24
 
25
- __Training status__
26
 
27
  - Evaluation CrossEntropy Loss
28
- <img src="https://cdn-uploads.huggingface.co/production/uploads/5f40b34279c1ba4c353d0c7a/Mr0yAg9AfXTm15GATgSTN.png" alt="alt text" width="620" height="620">
 
 
29
 
30
  - Language Perplexity
31
- <img src="https://cdn-uploads.huggingface.co/production/uploads/5f40b34279c1ba4c353d0c7a/B-ZC1LfFZdCTO25Twcyth.png" alt="alt text" width="620" height="620">
 
 
32
 
33
  ## Datasets
34
  We add Bangla text datasets from several sources including
 
22
  - vocab_size: 72000
23
  - attn_impl: flash
24
 
25
+ __Training evaluation status__
26
 
27
  - Evaluation CrossEntropy Loss
28
+
29
+ Final loss: 3.11
30
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/5f40b34279c1ba4c353d0c7a/Mr0yAg9AfXTm15GATgSTN.png" alt="alt text" width="620" height="620">
31
 
32
  - Language Perplexity
33
+
34
+ Final Perplexity: 22.562
35
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/5f40b34279c1ba4c353d0c7a/B-ZC1LfFZdCTO25Twcyth.png" alt="alt text" width="620" height="620">
36
 
37
  ## Datasets
38
  We add Bangla text datasets from several sources including