B2_1000_1e-5_hp-mehrdad / all_results.json
lnxdx's picture
End of training
517e83c
raw
history blame
235 Bytes
{
"epoch": 6.25,
"total_flos": 4.798362622332561e+18,
"train_loss": 0.7318317260742188,
"train_runtime": 4802.1983,
"train_samples": 2554,
"train_samples_per_second": 3.332,
"train_steps_per_second": 0.208
}