Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yujunzhou 's Collections
EVOL-RL

EVOL-RL

updated Oct 3

The models trained with EVOL-RL

Upvote
1

  • yujunzhou/EVOL-RL-MATH-Train-Qwen3-4B-Base

    4B • Updated Sep 13 • 5

  • yujunzhou/EVOL-RL-MATH-500-Qwen3-4B-Base

    4B • Updated Sep 13 • 16

  • yujunzhou/EVOL-RL-AIME24-Qwen3-4B-Base

    4B • Updated Aug 17 • 5

  • yujunzhou/EVOL-RL-MATH-Train-Qwen3-8B-Base

    8B • Updated Sep 18 • 4

  • yujunzhou/EVOL-RL-MATH-500-Qwen3-8B-Base

    8B • Updated Aug 29 • 1

  • yujunzhou/EVOL-RL-AIME24-Qwen3-8B-Base

    8B • Updated Aug 26 • 2

  • Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

    Paper • 2509.15194 • Published Sep 18 • 33
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs