ibm-granite
/

granite-3.0-2b-base

@@ -30,7 +30,7 @@ model-index:
     metrics:
     - name: pass@1
       type: pass@1
-      value: 23.64
       veriefied: false
   - task:
       type: text-generation
@@ -40,7 +40,7 @@ model-index:
     metrics:
     - name: pass@1
       type: pass@1
-      value: 21.75
       veriefied: false
   - task:
       type: text-generation
@@ -50,7 +50,7 @@ model-index:
     metrics:
     - name: pass@1
       type: pass@1
-      value: 71.59
       veriefied: false
   - task:
       type: text-generation
@@ -60,7 +60,7 @@ model-index:
     metrics:
     - name: pass@1
       type: pass@1
-      value: 42.80
       veriefied: false
   - task:
       type: text-generation
@@ -90,7 +90,7 @@ model-index:
     metrics:
     - name: pass@1
       type: pass@1
-      value: 75.76
       veriefied: false
   - task:
       type: text-generation
@@ -130,7 +130,7 @@ model-index:
     metrics:
     - name: pass@1
       type: pass@1
-      value: 47.61
       veriefied: false
   - task:
       type: text-generation
@@ -140,7 +140,7 @@ model-index:
     metrics:
     - name: pass@1
       type: pass@1
-      value: 29.19
       veriefied: false
   - task:
       type: text-generation
@@ -150,7 +150,17 @@ model-index:
     metrics:
     - name: pass@1
       type: pass@1
-      value: 46.89
       veriefied: false
   - task:
       type: text-generation
@@ -160,7 +170,7 @@ model-index:
     metrics:
     - name: pass@1
       type: pass@1
-      value: 31.71
       veriefied: false
   - task:
       type: text-generation
@@ -180,7 +190,7 @@ model-index:
     metrics:
     - name: pass@1
       type: pass@1
-      value: 51.48
       veriefied: false
   - task:
       type: text-generation
@@ -191,19 +201,11 @@ model-index:
     - name: pass@1
       type: pass@1
       value: 19.46
-      veriefied: false
-  - task:
-      type: text-generation
-    dataset:
-        type: multilingual
-        name: MGSM
-    metrics:
-    - name: pass@1
-      type: pass@1
-      value: 30.47
       veriefied: false
 ---
 <!-- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62cd5057674cdb524450093d/1hzxoPwqkBJXshKVVe6_9.png) -->
 # Granite-3.0-2B-Base
@@ -211,14 +213,14 @@ model-index:
 **Granite-3.0-2B-Base** is an open-source decoder-only language model from IBM Research that supports a variety of text-to-text generation tasks (e.g., question-answering, text-completion). **Granite-3.0-2B-Base** is trained from scratch and follows a two-phase training strategy. In the first phase, it is trained on 10 trillion tokens sourced from diverse domains. During the second phase, it is further trained on 2 trillion tokens using a carefully curated mix of high-quality data, aiming to enhance its performance on specific tasks.
 - **Developers:** IBM Research
-- **GitHub Repository:** [ibm-granite/granite-language-models](https://github.com/ibm-granite/granite-language-models)
 - **Website**: [Granite Docs](https://www.ibm.com/granite/docs/)
-- **Paper:** [Granite Language Models](https://) <!--     TO DO: Update github repo link when it is ready -->
 - **Release Date**: October 21st, 2024
-- **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0).
 ## Supported Languages
-English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, Chinese (Simplified)
 ## Usage
 ### Intended use
@@ -300,4 +302,4 @@ The use of Large Language Models involves risks and ethical considerations peopl
   year = {2024},
   url = {https://arxiv.org/abs/0000.00000},
 }
-```

     metrics:
     - name: pass@1
       type: pass@1
+      value: 23.79
       veriefied: false
   - task:
       type: text-generation
     metrics:
     - name: pass@1
       type: pass@1
+      value: 22.56
       veriefied: false
   - task:
       type: text-generation
     metrics:
     - name: pass@1
       type: pass@1
+      value: 74.90
       veriefied: false
   - task:
       type: text-generation
     metrics:
     - name: pass@1
       type: pass@1
+      value: 43.00
       veriefied: false
   - task:
       type: text-generation
     metrics:
     - name: pass@1
       type: pass@1
+      value: 77.65
       veriefied: false
   - task:
       type: text-generation
     metrics:
     - name: pass@1
       type: pass@1
+      value: 54.27
       veriefied: false
   - task:
       type: text-generation
     metrics:
     - name: pass@1
       type: pass@1
+      value: 30.58
       veriefied: false
   - task:
       type: text-generation
     metrics:
     - name: pass@1
       type: pass@1
+      value: 40.69
+      veriefied: false
+  - task:
+      type: text-generation
+    dataset:
+        type: reasoning
+        name: MUSR
+    metrics:
+    - name: pass@1
+      type: pass@1
+      value: 34.34
       veriefied: false
   - task:
       type: text-generation
     metrics:
     - name: pass@1
       type: pass@1
+      value: 38.41
       veriefied: false
   - task:
       type: text-generation
     metrics:
     - name: pass@1
       type: pass@1
+      value: 47.23
       veriefied: false
   - task:
       type: text-generation
     - name: pass@1
       type: pass@1
       value: 19.46
       veriefied: false
 ---
+> IMPORTANT: This model card is an early draft, the final version will available in Hugging Face on Oct 21st, 2024
 <!-- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62cd5057674cdb524450093d/1hzxoPwqkBJXshKVVe6_9.png) -->
+![image/png](granite-3_0-language-models_Group_1.png)
 # Granite-3.0-2B-Base
 **Granite-3.0-2B-Base** is an open-source decoder-only language model from IBM Research that supports a variety of text-to-text generation tasks (e.g., question-answering, text-completion). **Granite-3.0-2B-Base** is trained from scratch and follows a two-phase training strategy. In the first phase, it is trained on 10 trillion tokens sourced from diverse domains. During the second phase, it is further trained on 2 trillion tokens using a carefully curated mix of high-quality data, aiming to enhance its performance on specific tasks.
 - **Developers:** IBM Research
+- **GitHub Repository:** [ibm-granite/granite-3.0-language-models](https://github.com/ibm-granite/granite-3.0-language-models)
 - **Website**: [Granite Docs](https://www.ibm.com/granite/docs/)
+- **Paper:** [Granite 3.0 Language Models]()
 - **Release Date**: October 21st, 2024
+- **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
 ## Supported Languages
+English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, Chinese
 ## Usage
 ### Intended use
   year = {2024},
   url = {https://arxiv.org/abs/0000.00000},
 }
+```