Variance Reduced Policy Gradient Method for Multi-Objective Reinforcement Learning Paper • 2508.10608 • Published Aug 14 • 1