Reasoning Analysis When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37 When-Does-Reasoning-Matter/general-reasoning-ift-pairs Viewer • Updated Sep 29 • 2.97M • 168 • 3 When-Does-Reasoning-Matter/math-reasoning-ift-pairs Viewer • Updated 16 days ago • 458k • 189 • 7
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37
MLM vs CLM Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 78 MLMvsCLM/610m-mlm40-42k-10000 Feature Extraction • Updated Jul 4 • 3 MLMvsCLM/610m-clm-40k-mlm20-42k Feature Extraction • Updated Jul 4 • 7 MLMvsCLM/1b-mlm40-42k Feature Extraction • Updated Jul 4 • 9
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 78
Reasoning Analysis When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37 When-Does-Reasoning-Matter/general-reasoning-ift-pairs Viewer • Updated Sep 29 • 2.97M • 168 • 3 When-Does-Reasoning-Matter/math-reasoning-ift-pairs Viewer • Updated 16 days ago • 458k • 189 • 7
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37
MLM vs CLM Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 78 MLMvsCLM/610m-mlm40-42k-10000 Feature Extraction • Updated Jul 4 • 3 MLMvsCLM/610m-clm-40k-mlm20-42k Feature Extraction • Updated Jul 4 • 7 MLMvsCLM/1b-mlm40-42k Feature Extraction • Updated Jul 4 • 9
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 78