MMR1-3B-SFT / all_results.json
Sicong's picture
Add files using upload-large-folder tool
71b17e3 verified
raw
history blame contribute delete
224 Bytes
{
"epoch": 4.99952614120992,
"total_flos": 3.246606278526417e+20,
"train_loss": 0.08344055705064467,
"train_runtime": 26260.8831,
"train_samples_per_second": 308.537,
"train_steps_per_second": 0.301
}