FANformer: Improving Large Language Models Through Effective Periodicity Modeling Paper • 2502.21309 • Published Feb 28 • 1
RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization Paper • 2508.00222 • Published Jul 31 • 6
Detecting Data Contamination from Reinforcement Learning Post-training for Large Language Models Paper • 2510.09259 • Published Oct 10 • 2
Detecting Data Contamination from Reinforcement Learning Post-training for Large Language Models Paper • 2510.09259 • Published Oct 10 • 2 • 2
Detecting Data Contamination from Reinforcement Learning Post-training for Large Language Models Paper • 2510.09259 • Published Oct 10 • 2