mhhmm
/

typescript-instruct-20k-v2

@@ -2,36 +2,33 @@
 license: llama2
 library_name: peft
 tags:
-- generated_from_trainer
 base_model: codellama/CodeLlama-13b-hf
 model-index:
 - name: lora-out
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
-# lora-out
-This model is a fine-tuned version of [codellama/CodeLlama-13b-hf](https://huggingface.co/codellama/CodeLlama-13b-hf) on the None dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.4263
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -53,26 +50,27 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.7628        | 0.01  | 1    | 0.7296          |
-| 0.7101        | 0.05  | 7    | 0.6906          |
-| 0.5395        | 0.1   | 14   | 0.5214          |
-| 0.5303        | 0.15  | 21   | 0.4871          |
-| 0.4821        | 0.2   | 28   | 0.4676          |
-| 0.5643        | 0.25  | 35   | 0.4563          |
-| 0.5307        | 0.3   | 42   | 0.4484          |
-| 0.5103        | 0.35  | 49   | 0.4445          |
-| 0.5515        | 0.4   | 56   | 0.4415          |
-| 0.4983        | 0.45  | 63   | 0.4386          |
-| 0.4919        | 0.5   | 70   | 0.4351          |
-| 0.4674        | 0.55  | 77   | 0.4316          |
-| 0.5193        | 0.6   | 84   | 0.4295          |
-| 0.4461        | 0.65  | 91   | 0.4295          |
-| 0.4541        | 0.71  | 98   | 0.4280          |
-| 0.486         | 0.76  | 105  | 0.4280          |
-| 0.4875        | 0.81  | 112  | 0.4269          |
-| 0.5553        | 0.86  | 119  | 0.4266          |
-| 0.4605        | 0.91  | 126  | 0.4260          |
-| 0.4767        | 0.96  | 133  | 0.4263          |
 ### Framework versions
@@ -81,10 +79,27 @@ The following hyperparameters were used during training:
 - Pytorch 2.0.1+cu118
 - Datasets 2.15.0
 - Tokenizers 0.15.0
-## Training procedure
-### Framework versions
-- PEFT 0.6.0

 license: llama2
 library_name: peft
 tags:
+- typescript
+- instruction-tuning
+- code-generation
+- lora
+- peft
 base_model: codellama/CodeLlama-13b-hf
 model-index:
 - name: lora-out
   results: []
+datasets:
+- mhhmm/typescript-instruct-20k
+language:
+- en
+metrics:
+- code_eval
+pipeline_tag: text-generation
 ---
+## Architecture
+![The Architecture](https://github.com/LeVuMinhHuy/brocode/blob/master/.pics/about-the-model.png?raw=true)
+## About
+This model is a fine-tuned version of [codellama/CodeLlama-13b-hf](https://huggingface.co/codellama/CodeLlama-13b-hf).
+It achieves the following results on the evaluation set:
+- Loss: 0.4268
 ### Training hyperparameters
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.7555        | 0.01  | 1    | 0.7062          |
+| 0.7036        | 0.05  | 7    | 0.6673          |
+| 0.5422        | 0.1   | 14   | 0.5152          |
+| 0.5351        | 0.15  | 21   | 0.4866          |
+| 0.495         | 0.2   | 28   | 0.4688          |
+| 0.5651        | 0.25  | 35   | 0.4587          |
+| 0.5146        | 0.3   | 42   | 0.4486          |
+| 0.4955        | 0.35  | 49   | 0.4469          |
+| 0.5117        | 0.4   | 56   | 0.4432          |
+| 0.5245        | 0.45  | 63   | 0.4410          |
+| 0.5003        | 0.5   | 70   | 0.4371          |
+| 0.4502        | 0.55  | 77   | 0.4340          |
+| 0.527         | 0.6   | 84   | 0.4315          |
+| 0.48          | 0.65  | 91   | 0.4305          |
+| 0.448         | 0.7   | 98   | 0.4289          |
+| 0.5427        | 0.75  | 105  | 0.4289          |
+| 0.4715        | 0.8   | 112  | 0.4279          |
+| 0.5584        | 0.85  | 119  | 0.4276          |
+| 0.4936        | 0.9   | 126  | 0.4267          |
+| 0.4788        | 0.95  | 133  | 0.4268          |
+| 0.476         | 1.0   | 140  | 0.4268          |
 ### Framework versions
 - Pytorch 2.0.1+cu118
 - Datasets 2.15.0
 - Tokenizers 0.15.0
+- PEFT 0.6.0
+### Evaluation
+I'm using MultiPL-E benchmark, the same as Code Llmama using in their paper
+How to reproduce my evaluation? Just run like the offical document of MultiPL-E: https://nuprl.github.io/MultiPL-E/tutorial.html, change the modal name by my model here: `mhhmm/typescript-instruct-20k`
+This is the code that I ran with Google Colab (using A100 40GB, yes, it requires that much GPU RAM)
+If you even have a stronger GPU, increase the --batch-size, or --completion-limit
+```
+!pip install --upgrade pip
+!pip install aiohttp numpy tqdm pytest datasets torch transformers sentencepiece
+!git clone https://github.com/nuprl/MultiPL-E
+%cd MultiPL-E
+!mkdir typescript
+!python3 automodel.py --name mhhmm/typescript-instruct-20k --root-dataset humaneval --lang ts --temperature 0.2 --batch-size 10 --completion-limit 20 --output-dir-prefix typescript
+%cd evaluation/src
+!python3 main.py --dir ../../typescript --output-dir ../../typescript --recursive
+!python3 pass_k.py ./typescript/*
+```