Tan's picture

3 1

Tan

RiccardTo

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

upvoted a paper 7 months ago

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

upvoted a paper 11 months ago

Autonomy-of-Experts Models

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published 5 days ago • 59

upvoted a paper 7 months ago

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Paper • 2505.22653 • Published May 28 • 66

upvoted a paper 11 months ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 44

liked a model over 1 year ago

Ori/llama-2-13b-peft-strategyqa-with-ret-at-1

Updated Sep 22, 2023 • 7 • 1