Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper β’ 2505.03335 β’ Published May 6 β’ 186 β’ 9