SVRL2 (svrl2)

SivilTaram

authored a paper 3 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 129

MrLight

updated a dataset 3 months ago

SVRL2/general-reasoner-v2-data-fineweb-megamath-1014-top40

Viewer • Updated Nov 4, 2025 • 1.1M • 12

MrLight

published a dataset 3 months ago

SVRL2/general-reasoner-v2-data-fineweb-megamath-1014-top40

Viewer • Updated Nov 4, 2025 • 1.1M • 12

MrLight

updated a dataset 3 months ago

SVRL2/general-reasoner-v2-data-fineweb-megamath-1014

Viewer • Updated Oct 29, 2025 • 2.8M • 4

MrLight

published a dataset 3 months ago

SVRL2/general-reasoner-v2-data-fineweb-megamath-1014

Viewer • Updated Oct 29, 2025 • 2.8M • 4

MrLight

updated a model 3 months ago

SVRL2/general-reasoner-v2-data-fineweb-megamath-1014

Updated Oct 29, 2025

MrLight

published a model 3 months ago

SVRL2/general-reasoner-v2-data-fineweb-megamath-1014

Updated Oct 29, 2025

MrLight

updated a model 3 months ago

SVRL2/general-sharding-output-megamath-1014

Updated Oct 29, 2025

MrLight

updated a dataset 3 months ago

SVRL2/general-reasoner-v2-data-fineweb-megamath

Viewer • Updated Oct 29, 2025 • 2.2M • 13

MrLight

published a dataset 3 months ago

SVRL2/general-reasoner-v2-data-fineweb-megamath

Viewer • Updated Oct 29, 2025 • 2.2M • 13

MrLight

published a model 3 months ago

SVRL2/general-sharding-output-megamath-1014

Updated Oct 29, 2025

MrLight

published a dataset 3 months ago

SVRL2/general-sharding-output-megamath-1014

Updated Oct 29, 2025 • 1

MrLight

updated 2 models 3 months ago

SVRL2/verl-scalable-1025_general-reasoner-deepscaler_Qwen3-4B-Base

Updated Oct 28, 2025

SVRL2/verl-scalable-1025_general-reasoner-deepscaler_general-reasoner-mid-fineweb-webinst-1014-Qwen3-4

Updated Oct 28, 2025

MrLight

published 2 models 3 months ago

SVRL2/verl-scalable-1025_general-reasoner-deepscaler_Qwen3-4B-Base

Updated Oct 28, 2025

SVRL2/verl-scalable-1025_general-reasoner-deepscaler_general-reasoner-mid-fineweb-webinst-1014-Qwen3-4

Updated Oct 28, 2025

SivilTaram

authored a paper 5 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 84

MrLight

authored a paper 6 months ago

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

Paper • 2508.06600 • Published Aug 8, 2025 • 41

SivilTaram

authored 2 papers 7 months ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16, 2025 • 43

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9, 2025 • 24

AI & ML interests

Team members 2

SVRL2's activity