Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities Paper • 2601.21937 • Published 14 days ago • 19
OAgents: An Empirical Study of Building Effective Agents Paper • 2506.15741 • Published Jun 17, 2025 • 35
ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems Paper • 2510.11652 • Published Oct 13, 2025 • 30