Runtime error 25 Gaia2 Agents Evaluation Leaderboard ๐ 25 Display and submit model evaluation results on a leaderboard
moonshotai/Kimi-K2-Instruct-0905 Text Generation โข 1T โข Updated Nov 7, 2025 โข 20.6k โข โข 657
MedReseacher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework Paper โข 2508.14880 โข Published Aug 20, 2025 โข 15