Critique to Verify: Accurate and Honest Test-Time Scaling with RL-Trained Verifiers (https://arxiv.org/abs/2509.23152)
Zhicheng YANG
yangzhch6
AI & ML interests
reasoning with LLMs
Recent Activity
updated
a dataset
2 days ago
yangzhch6/Accordion-Thinking-Synthetic-Data
published
a dataset
2 days ago
yangzhch6/Accordion-Thinking-Synthetic-Data
Organizations
None yet