Kelvin's picture

31

Kelvin

kh

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

upvoted a paper 8 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

upvoted a paper 15 days ago

Agent-as-a-Judge

View all activity

Organizations

None yet

kh 's datasets

None public yet