SR - a ItsRahulJyala Collection

ItsRahulJyala 's Collections

SR

SR

updated Aug 9

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 178
R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 127