train_wsc_42_1760620823 / train_results.json
rbelanec's picture
End of training
ddb2796 verified
{
"epoch": 30.0,
"num_input_tokens_seen": 1308280,
"total_flos": 5.891125709930496e+16,
"train_loss": 0.48614711109940356,
"train_runtime": 381.9899,
"train_samples_per_second": 34.791,
"train_steps_per_second": 8.718
}