view article Article Reinforcement Learning for Large Language Models: Beyond the Agent Paradigm Mar 19 • 8