Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
hamishivi 's Collections
RLVE
Large-Scale Data Selection for Instruction Tuning
TESS 2
Tulu 2 Llama 3 Update
7b tulu 2.5
Tulu V2 Suite
Tulu V1 Suite
LM Preference Datasets

RLVE

updated 6 days ago

Models for "RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments" - https://arxiv.org/abs/2511.07317

Upvote
4

  • RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

    Paper • 2511.07317 • Published 7 days ago • 12

  • hamishivi/OpenThinker3-1.5B-RLVE

    Text Generation • 2B • Updated 7 days ago • 55 • 1

  • hamishivi/Nemotron-Research-Reasoning-Qwen-1.5B-v2-RLVE

    Text Generation • 2B • Updated 7 days ago • 42 • 1
Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs