SVRL2/verl-scalable-1025_general-reasoner-deepscaler_general-reasoner-mid-fineweb-webinst-1014-Qwen3-4 Updated Oct 28, 2025
SVRL2/verl-scalable-1025_general-reasoner-deepscaler_general-reasoner-mid-fineweb-webinst-1014-Qwen3-4 Updated Oct 28, 2025
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning Paper • 2509.02479 • Published Sep 2, 2025 • 84
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent Paper • 2508.06600 • Published Aug 8, 2025 • 41
SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories? Paper • 2507.12415 • Published Jul 16, 2025 • 43