Edit model card

fnet-base-finetuned-stsb

This model is a fine-tuned version of google/fnet-base on the GLUE STSB dataset. It achieves the following results on the evaluation set:

  • Loss: 0.7894
  • Pearson: 0.8256
  • Spearmanr: 0.8219
  • Combined Score: 0.8238

The model was fine-tuned to compare google/fnet-base as introduced in this paper against bert-base-cased.

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

This model is trained using the run_glue script. The following command was used:

#!/usr/bin/bash

python ../run_glue.py \\n  --model_name_or_path google/fnet-base \\n  --task_name stsb \\n  --do_train \\n  --do_eval \\n  --max_seq_length 512 \\n  --per_device_train_batch_size 16 \\n  --learning_rate 2e-5 \\n  --num_train_epochs 3 \\n  --output_dir fnet-base-finetuned-stsb \\n  --push_to_hub \\n  --hub_strategy all_checkpoints \\n  --logging_strategy epoch \\n  --save_strategy epoch \\n  --evaluation_strategy epoch \\n```

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 16
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 3.0

### Training results

| Training Loss | Epoch | Step | Combined Score | Validation Loss | Pearson | Spearmanr |
|:-------------:|:-----:|:----:|:--------------:|:---------------:|:-------:|:---------:|
| 1.5473        | 1.0   | 360  | 0.8120         | 0.7751          | 0.8115  | 0.8125    |
| 0.6954        | 2.0   | 720  | 0.8145         | 0.8717          | 0.8160  | 0.8130    |
| 0.4828        | 3.0   | 1080 | 0.8238         | 0.7894          | 0.8256  | 0.8219    |


### Framework versions

- Transformers 4.11.0.dev0
- Pytorch 1.9.0
- Datasets 1.12.1
- Tokenizers 0.10.3
Downloads last month
18
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train gchhablani/fnet-base-finetuned-stsb

Evaluation results