Ximing Lu
Ximing
AI & ML interests
None yet
Recent Activity
submitted
a paper
3 days ago
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
authored
a paper
27 days ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization