Blog models
Collection
This is a set of models that come from the trainings I do in my personal blog
•
12 items
•
Updated
This model is a fine-tuned version of openai-community/gpt2 on mteb/amazon_reviews_multi dataset.
It is the result of the post QLoRA
It achieves the following results on the evaluation set:
This model provides classification of reviews in english
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss | Accuracy |
---|---|---|---|---|
2.088 | 1.0 | 893 | 0.9780 | 0.5848 |
0.9588 | 2.0 | 1786 | 0.8940 | 0.6156 |
0.9147 | 3.0 | 2679 | 0.8918 | 0.6168 |
Base model
openai-community/gpt2