No Prompt Left Behind: Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-Guided Advantage Shaping Paper โข 2509.21880 โข Published Sep 26, 2025 โข 53