SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning Paper โข 2512.03244 โข Published 8 days ago โข 14