Penalizing Infeasible Actions and Reward Scaling in Reinforcement Learning with Offline Data Paper • 2507.08761 • Published Jul 11 • 1
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5 • 119
ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection Paper • 2505.15182 • Published May 21 • 6