File size: 2,376 Bytes
55be50e 0f3b1d5 55be50e c320c58 55be50e 58c725f 55be50e 6a8f6eb 0f3b1d5 55be50e |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 |
---
license: apache-2.0
base_model: distilbert-base-uncased
tags:
- generated_from_keras_callback
model-index:
- name: hfnlpmodels/eos_prediction_distilbert_1
results: []
---
<!-- This model card has been generated automatically according to the information Keras had access to. You should
probably proofread and complete it, then remove this comment. -->
# hfnlpmodels/eos_prediction_distilbert_1
This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
It achieves the following results on the evaluation set:
- Train Loss: 0.0879
- Train Accuracy: 0.9714
- Validation Loss: 0.2180
- Validation Accuracy: 0.9305
- Epoch: 2
## Model description
Predicts (or should predict) whether a given sentence is complete. Trained on sentences that were randomly truncate as '0' labels, hence some sentences which were grammatically correct may have been labelled as incomplete. Overall accuracy near 0.9 means that this was most likely a small factor.
## Intended uses & limitations
Found better performance on shorter sentences.
## Training and evaluation data
More information needed
## Training procedure
### Training hyperparameters
The following hyperparameters were used during training:
- optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 4165, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'registered_name': None}, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
- training_precision: float32
### Training results
| Train Loss | Train Accuracy | Validation Loss | Validation Accuracy | Epoch |
|:----------:|:--------------:|:---------------:|:-------------------:|:-----:|
| 0.2709 | 0.8942 | 0.2015 | 0.9211 | 0 |
| 0.1421 | 0.9505 | 0.2055 | 0.9318 | 1 |
| 0.0879 | 0.9714 | 0.2180 | 0.9305 | 2 |
### Framework versions
- Transformers 4.38.2
- TensorFlow 2.15.0
- Tokenizers 0.15.2
|