Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning Paper • 2512.10534 • Published 4 days ago • 30
OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification Paper • 2512.10756 • Published 4 days ago • 31
Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving Paper • 2512.10739 • Published 4 days ago • 42
NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance Paper • 2505.08712 • Published May 13 • 6