train_qqp_1754652135 / train_results.json
rbelanec's picture
End of training
22d2928 verified
{
"epoch": 10.0,
"num_input_tokens_seen": 250787112,
"total_flos": 1.1292830305610564e+19,
"train_loss": 0.25612748204947167,
"train_runtime": 332664.6174,
"train_samples_per_second": 9.844,
"train_steps_per_second": 2.461
}