arXiv:2510.03222
Guanhua Huang
Carlanlarkk
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
Low-probability Tokens Sustain Exploration in Reinforcement Learning
with Verifiable Reward
upvoted
a
paper
about 1 month ago
Cogito, Ergo Ludo: An Agent that Learns to Play by Reasoning and
Planning
upvoted
a
paper
about 1 month ago
Low-probability Tokens Sustain Exploration in Reinforcement Learning
with Verifiable Reward
Organizations
None yet