sagorsarker
commited on
Commit
•
ca65849
1
Parent(s):
a4f214e
Update README.md
Browse files
README.md
CHANGED
@@ -21,6 +21,7 @@ Notable training configs:
|
|
21 |
- max_sequence_length: 2048
|
22 |
- vocab_size: 72000
|
23 |
- attn_impl: flash
|
|
|
24 |
|
25 |
__Training evaluation status__
|
26 |
|
|
|
21 |
- max_sequence_length: 2048
|
22 |
- vocab_size: 72000
|
23 |
- attn_impl: flash
|
24 |
+
- Trained on 8 H100 GPU on GCP
|
25 |
|
26 |
__Training evaluation status__
|
27 |
|