3 7 4

charliezhang

Clockz

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

allenai/Olmo-3.1-7B-RL-Zero-Math

new activity 1 day ago

Interplay-LM-Reasoning/extrapolation_midtrain:Add pipeline tag, GitHub link, and improved model description

new activity 1 day ago

Interplay-LM-Reasoning/extrapolation_rl:Improve model card: Add pipeline tag and GitHub link

View all activity

Organizations

liked a model 1 day ago

allenai/Olmo-3.1-7B-RL-Zero-Math

Text Generation • 528k • Updated 3 days ago • 61 • 7

New activity in Interplay-LM-Reasoning/extrapolation_midtrain 1 day ago

Add pipeline tag, GitHub link, and improved model description

#1 opened 2 days ago by

nielsr

New activity in Interplay-LM-Reasoning/extrapolation_rl 1 day ago

Improve model card: Add pipeline tag and GitHub link

#1 opened 2 days ago by

nielsr

updated 2 models 5 days ago

Interplay-LM-Reasoning/extrapolation_rl

Text Generation • Updated 1 day ago

Interplay-LM-Reasoning/extrapolation_midtrain

Text Generation • Updated 1 day ago

updated a dataset 5 days ago

Interplay-LM-Reasoning/context

Updated 5 days ago • 6

published 2 datasets 5 days ago

Interplay-LM-Reasoning/context

Updated 5 days ago • 6

Interplay-LM-Reasoning/extrapolation

Updated 5 days ago • 4

published 3 models 5 days ago

authored a paper 6 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published 7 days ago • 31

upvoted a paper 6 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published 7 days ago • 31

upvoted a paper 10 days ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published 11 days ago • 147

updated a model 24 days ago

goodevening/composition-10B-op-cpt-rl_fixed

Updated 24 days ago • 7

published a model 29 days ago

goodevening/composition-10B-op-cpt-rl_fixed

Updated 24 days ago • 7

upvoted 2 papers about 2 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29 • 45

Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences

Paper • 2510.23451 • Published Oct 27 • 26

updated a dataset 3 months ago

goodevening/context-10B

Viewer • Updated Sep 30 • 21M • 401

published a dataset 3 months ago