OwenArli commited on
Commit
30de888
1 Parent(s): 9e5f739

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -3
README.md CHANGED
@@ -1,3 +1,35 @@
1
- ---
2
- license: llama3.1
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3.1
3
+ ---
4
+ Based on Meta-Llama-3.1-8B-Instruct, and is governed by Meta Llama 3.1 License agreement:
5
+ https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct
6
+
7
+ Indonesian language continued pre-trained and Formax instruct tuned model. Excels in Bahasa Indonesia while having Formax instruct characteristics.
8
+
9
+ Model Llama 3.1 yang telat di-training dengan bahasa Indonesia dan juga menggunakan dataset bergaya Formax bahasa Indonesia. Cocok untuk kebutuhan text bahasa Indonesia.
10
+
11
+ Training:
12
+ - 8192 sequence length
13
+ - Training duration is around 6 days on 2x3090Ti
14
+ - 1 epoch training with a massive dataset.
15
+ - LORA with 64-rank 128-alpha resulting in ~2% trainable weights.
16
+
17
+ Quants:
18
+
19
+ BF16: https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-Indo-Formax-v1.0
20
+
21
+ GGUF: https://huggingface.co/ArliAI/Llama-3.1-8B-ArliAI-Indo-Formax-v1.0-GGUF
22
+
23
+
24
+ Suggested prompting strategy:
25
+ ```
26
+ <|begin_of_text|><|start_header_id|>system<|end_header_id|>
27
+
28
+ You are a [give it a role]. You are tasked with [give it a task]. Reply in the following format: [requested format of reply]<|eot_id|><|start_header_id|>user<|end_header_id|>
29
+
30
+ {{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
31
+
32
+ {{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>
33
+
34
+ {{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
35
+ ```