giulio98
/

codegen-350M-multi-xlcost

Text Generation

Inference Endpoints

Model card Files Files and versions Community

giulio98 commited on Nov 8, 2022

Commit

fe0bc5e

•

1 Parent(s): 55b4a67

Update README.md

Files changed (1) hide show

README.md +8 -14

README.md CHANGED Viewed

@@ -7,16 +7,10 @@ tags:
 datasets:
 - giulio98/xlcost-single-prompt
 widget:
-- text: "from transformer import"
-  example_title: "Transformers"
-- text: "def print_hello_world():\n\t"
-  example_title: "Hello World!"
-- text: "def get_file_size(filepath):"
-  example_title: "File size"
-- text: "import numpy as"
-  example_title: "Numpy"
 model-index:
-- name: codegen-350M-multi-xlcost-v2
   results:
   - task:
       name: Code Generation
@@ -43,8 +37,8 @@ You can load the CodeGen-350M-multi-xlcost model and tokenizer directly in `tran
 ```Python
 from transformers import AutoTokenizer, AutoModelForCausalLM
-tokenizer = AutoTokenizer.from_pretrained("giulio98/codegen-350M-multi-xlcost-v2")
-model = AutoModelForCausalLM.from_pretrained("giulio98/codegen-350M-multi-xlcost-v2")
 text = tokenizer.eos_token + "\'\'\'\n" + "function to add two numbers" + "\n\'\'\'\n" + "###\n"
 input_ids = tokenizer(text, return_tensors="pt").input_ids
@@ -67,7 +61,7 @@ The model was finetuned on [XLCost-single-prompt](https://huggingface.co/dataset
 xlcost-text-to-code](https://huggingface.co/datasets/codeparrot/xlcost-text-to-code). Below the hyperparameters.
-|------|------------------
 |Per device train batch size| 8 |
 |Context size| 1024 |
 |Training steps| 258|
@@ -84,8 +78,8 @@ The training was executed on 1 x V100 (16GB) GPU for 6h 42m
 We evaluated the model on the first 400 samples of XLCOST's [XLCost-single-prompt test split](https://huggingface.co/datasets/giulio98/xlcost-single-prompt/viewer/Python/test) and comparing the outputs of the generated codes with respect to the expected output using pass@k metric.
-| Metric | codegen-350M-multi-xlcost-v2 |
-|--------|-----|-----|
 |pass@1 | 3.70% |
 |pass@10 | 14.5% |

 datasets:
 - giulio98/xlcost-single-prompt
 widget:
+- text: "'''\nfunction to add two numbers\n'''\n###\n"
+  example_title: "add two numbers"
 model-index:
+- name: codegen-350M-multi-xlcost
   results:
   - task:
       name: Code Generation
 ```Python
 from transformers import AutoTokenizer, AutoModelForCausalLM
+tokenizer = AutoTokenizer.from_pretrained("giulio98/codegen-350M-multi-xlcost")
+model = AutoModelForCausalLM.from_pretrained("giulio98/codegen-350M-multi-xlcost")
 text = tokenizer.eos_token + "\'\'\'\n" + "function to add two numbers" + "\n\'\'\'\n" + "###\n"
 input_ids = tokenizer(text, return_tensors="pt").input_ids
 xlcost-text-to-code](https://huggingface.co/datasets/codeparrot/xlcost-text-to-code). Below the hyperparameters.
+|------|------------------|
 |Per device train batch size| 8 |
 |Context size| 1024 |
 |Training steps| 258|
 We evaluated the model on the first 400 samples of XLCOST's [XLCost-single-prompt test split](https://huggingface.co/datasets/giulio98/xlcost-single-prompt/viewer/Python/test) and comparing the outputs of the generated codes with respect to the expected output using pass@k metric.
+| Metric | codegen-350M-multi-xlcost |
+|--------|-----|
 |pass@1 | 3.70% |
 |pass@10 | 14.5% |