Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models Paper • 2508.10751 • Published Aug 14 • 28
YuLan-Mini Resources Collection Pre-Training & post-training resources for YuLan-Mini • 29 items • Updated May 7 • 2
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published Apr 30 • 59
YuLan-Mini Resources Collection Pre-Training & post-training resources for YuLan-Mini • 29 items • Updated May 7 • 2