EVOL-RL - a yujunzhou Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

yujunzhou 's Collections

EVOL-RL

EVOL-RL

updated Oct 3

The models trained with EVOL-RL

yujunzhou/EVOL-RL-MATH-Train-Qwen3-4B-Base

4B • Updated Sep 13 • 5
yujunzhou/EVOL-RL-MATH-500-Qwen3-4B-Base

4B • Updated Sep 13 • 16
yujunzhou/EVOL-RL-AIME24-Qwen3-4B-Base

4B • Updated Aug 17 • 5
yujunzhou/EVOL-RL-MATH-Train-Qwen3-8B-Base

8B • Updated Sep 18 • 4
yujunzhou/EVOL-RL-MATH-500-Qwen3-8B-Base

8B • Updated Aug 29 • 1
yujunzhou/EVOL-RL-AIME24-Qwen3-8B-Base

8B • Updated Aug 26 • 2
Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18 • 33

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs