-
Dr. Zero: Self-Evolving Search Agents without Training Data
Paper • 2601.07055 • Published • 19 -
Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models
Paper • 2503.04813 • Published • 2 -
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper • 2505.03335 • Published • 189
tran minh thang
thangtm
·
AI & ML interests
None yet
Recent Activity
updated
a collection
2 days ago
data
updated
a collection
3 days ago
zero-data
upvoted
a
paper
3 days ago
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Organizations
None yet