Update README.md
Browse files
README.md
CHANGED
@@ -10,10 +10,14 @@ metrics:
|
|
10 |
- accuracy
|
11 |
pipeline_tag: text-classification
|
12 |
widget:
|
13 |
-
- text: "Would you like to join a major manufacturing company?"
|
14 |
- text: "You must be proficient in Excel."
|
|
|
15 |
---
|
16 |
|
|
|
|
|
|
|
|
|
17 |
## 🖊️ Model description
|
18 |
|
19 |
This model is a fine-tuned version of [jjzha/jobbert-base-cased](https://huggingface.co/jjzha/jobbert-base-cased). JobBERT is a continuously pre-trained bert-base-cased checkpoint on ~3.2M sentences from job postings.
|
@@ -31,6 +35,8 @@ It achieves the following results on a held out test set 147 sentences:
|
|
31 |
| not company description | 0.930693 |0.959184|0.944724|98|
|
32 |
| company description | 0.913043 |0.857143|0.884211|49|
|
33 |
|
|
|
|
|
34 |
## 🖨️ Use
|
35 |
|
36 |
To use the model:
|
|
|
10 |
- accuracy
|
11 |
pipeline_tag: text-classification
|
12 |
widget:
|
|
|
13 |
- text: "You must be proficient in Excel."
|
14 |
+
- text: "Would you like to join a major manufacturing company?"
|
15 |
---
|
16 |
|
17 |
+
_Nesta, the UK's innovation agency, has been scraping online job adverts since 2021 and have been building algorithms to extract and structure information as part of the [Open Jobs Observatory](https://www.nesta.org.uk/project/open-jobs-observatory/) project._
|
18 |
+
|
19 |
+
_Although we are unable to share the raw data openly, we aim to open source **our models, algorithms and tools** so that anyone can use them for their own research and analysis._
|
20 |
+
|
21 |
## 🖊️ Model description
|
22 |
|
23 |
This model is a fine-tuned version of [jjzha/jobbert-base-cased](https://huggingface.co/jjzha/jobbert-base-cased). JobBERT is a continuously pre-trained bert-base-cased checkpoint on ~3.2M sentences from job postings.
|
|
|
35 |
| not company description | 0.930693 |0.959184|0.944724|98|
|
36 |
| company description | 0.913043 |0.857143|0.884211|49|
|
37 |
|
38 |
+
The code for training the model is in our [ojd_daps_language_models repo](https://github.com/nestauk/ojd_daps_language_models), a central repository for fine-tuning transformer models on our database of scraped job adverts.
|
39 |
+
|
40 |
## 🖨️ Use
|
41 |
|
42 |
To use the model:
|