nlpie
/

bio-distilbert-cased

Inference Endpoints

Model card Files Files and versions Community

omidrohanian commited on Nov 4, 2022

Commit

8fde8b1

•

1 Parent(s): 0a27449

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -1,12 +1,12 @@
 # Model Description
-BioDistilBERT-cased is the result of training the [DistilBERT-cased](https://huggingface.co/distilbert-base-cased?text=The+goal+of+life+is+%5BMASK%5D.) model in a continual learning fashion for 200k training steps using a total batch size of 192 on the PubMed dataset.
 # Initialisation
-We initialise our model with the pre-trained checkpoints of the [DistilBERT-cased](https://huggingface.co/distilbert-base-cased?text=The+goal+of+life+is+%5BMASK%5D.) model available on the Huggingface.
 # Architecture
-In this model, the size of the hidden dimension and the embedding layer are both set to 768. The vocabulary size is 28996. The number of transformer layers is 6 and the expansion rate of the feed-forward layer is 4. Overall this model has around 65 million parameters.
 # Citation
 If you use this model, please consider citing the following paper:

 # Model Description
+BioDistilBERT-cased was developed by training the [DistilBERT-cased](https://huggingface.co/distilbert-base-cased?text=The+goal+of+life+is+%5BMASK%5D.) model in a continual learning fashion for 200k training steps using a total batch size of 192 on the PubMed dataset.
 # Initialisation
+We initialise our model with the pre-trained checkpoints of the [DistilBERT-cased](https://huggingface.co/distilbert-base-cased?text=The+goal+of+life+is+%5BMASK%5D.) model available on Huggingface.
 # Architecture
+In this model, the size of the hidden dimension and the embedding layer are both set to 768. The vocabulary size is 28996. The number of transformer layers is 6 and the expansion rate of the feed-forward layer is 4. Overall, this model has around 65 million parameters.
 # Citation
 If you use this model, please consider citing the following paper: