afrizalha
/

Kancil-V0-llama3

Text Generation

text-generation-inference

8-bit precision

Model card Files Files and versions Community

Afrizal Hasbi Azizy commited on May 4

Commit

2b4cb5e

•

1 Parent(s): a39e3c8

Update README.md

Files changed (1) hide show

README.md +6 -7

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ language:
 <center>
     <img src="https://imgur.com/9nG5J1T.png" alt="Kancil" width="600" height="300">
     <p><em>Kancil is a fine-tuned version of Llama 3 8B using synthetic QA dataset generated with Llama 3 70B. Version zero of Kancil is the first generative Indonesian LLM gain functional instruction performance using solely synthetic data.</em></p>
-  <p><em><a href="https://colab.research.google.com/drive/1526QJYfk32X1CqYKX7IA_FFcIHLXbOkx?usp=sharing" style="color: blue;">Go straight to the colab demo</a></em></p>
 </center>
 ### Introducing the Kancil family of open models
@@ -88,14 +88,13 @@ pass
 FastLanguageModel.for_inference(model)
 inputs = tokenizer(
 [
-    prompt_template.format(
-        prompt="Apa itu generative AI?",
-        response="",
-    )
 ], return_tensors = "pt").to("cuda")
-outputs = model.generate(**inputs, max_new_tokens = 128, temperature=.8, use_cache = True)
-print(tokenizer.batch_decode(outputs)[0])
 ```
 **Note:** There was an issue with the dataset such that newline characters are printed as string literals. Sorry about that!

 <center>
     <img src="https://imgur.com/9nG5J1T.png" alt="Kancil" width="600" height="300">
     <p><em>Kancil is a fine-tuned version of Llama 3 8B using synthetic QA dataset generated with Llama 3 70B. Version zero of Kancil is the first generative Indonesian LLM gain functional instruction performance using solely synthetic data.</em></p>
+    <p><em><a href="https://colab.research.google.com/drive/1526QJYfk32X1CqYKX7IA_FFcIHLXbOkx?usp=sharing" style="color: blue;">Go straight to the colab demo</a></em></p>
 </center>
 ### Introducing the Kancil family of open models
 FastLanguageModel.for_inference(model)
 inputs = tokenizer(
 [
+prompt_template.format(
+        prompt="Bagaimana canting dan malam digunakan untuk menggambar pola batik?",
+        response="",)
 ], return_tensors = "pt").to("cuda")
+outputs = model.generate(**inputs, max_new_tokens = 600, temperature=.8, use_cache = True)
+print(tokenizer.batch_decode(outputs)[0].replace('\\n', '\n'))
 ```
 **Note:** There was an issue with the dataset such that newline characters are printed as string literals. Sorry about that!