arxiv:2509.25300
Xiaohang Yu
xhyumiracle
ยท
AI & ML interests
Agentic RL
Recent Activity
authored
a paper
about 22 hours ago
Scaling Behaviors of LLM Reinforcement Learning Post-Training: An Empirical Study in Mathematical Reasoning
upvoted
a
paper
about 22 hours ago
Scaling Behaviors of LLM Reinforcement Learning Post-Training: An Empirical Study in Mathematical Reasoning
authored
a paper
4 months ago
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Organizations
None yet