lewtun HF staff commited on
Commit
8adbba9
1 Parent(s): a75ac21

Add evaluation results on the squad_v2 config of squad_v2

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the squad_v2 config of the [squad_v2](https://huggingface.co/datasets/squad_v2) dataset by @sjrlee, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-squad_v2-e06b4410-11855587).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=squad_v2).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=squad_v2).

Files changed (1) hide show
  1. README.md +20 -0
README.md CHANGED
@@ -6,6 +6,26 @@ license: mit
6
  thumbnail: https://thumb.tildacdn.com/tild3433-3637-4830-a533-353833613061/-/resize/720x/-/format/webp/germanquad.jpg
7
  tags:
8
  - exbert
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  ---
10
 
11
  ![bert_image](https://thumb.tildacdn.com/tild3433-3637-4830-a533-353833613061/-/resize/720x/-/format/webp/germanquad.jpg)
 
6
  thumbnail: https://thumb.tildacdn.com/tild3433-3637-4830-a533-353833613061/-/resize/720x/-/format/webp/germanquad.jpg
7
  tags:
8
  - exbert
9
+ model-index:
10
+ - name: deepset/xlm-roberta-base-squad2-distilled
11
+ results:
12
+ - task:
13
+ type: question-answering
14
+ name: Question Answering
15
+ dataset:
16
+ name: squad_v2
17
+ type: squad_v2
18
+ config: squad_v2
19
+ split: validation
20
+ metrics:
21
+ - name: Exact Match
22
+ type: exact_match
23
+ value: 75.2485
24
+ verified: true
25
+ - name: F1
26
+ type: f1
27
+ value: 78.3094
28
+ verified: true
29
  ---
30
 
31
  ![bert_image](https://thumb.tildacdn.com/tild3433-3637-4830-a533-353833613061/-/resize/720x/-/format/webp/germanquad.jpg)