DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively Paper • 2509.26603 • Published Sep 30, 2025 • 17
REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once Paper • 2507.10541 • Published Jul 14, 2025 • 30