Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

agentica-org/DeepScaleR-Preview-Dataset

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

132

Full-text search

Active filters: agentica-org/DeepScaleR-Preview-Dataset

hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier

Reinforcement Learning • 8B • Updated May 28, 2025 • 3

mradermacher/E1-Math-7B-i1-GGUF

8B • Updated May 25, 2025 • 63

TingchenFu/coldrl_3k_qwen-2.5-1.5b_04232202

Text Generation • 2B • Updated May 26, 2025 • 2

TingchenFu/coldrl_3k_qwen-2.5-7b_04240151

Text Generation • 8B • Updated May 26, 2025 • 2

TingchenFu/coldrl_3k_qwen-2.5-math-1.5b_04201604

Text Generation • 2B • Updated May 26, 2025 • 4

TingchenFu/coldrl_qwen-2.5-math-7b_04252230

Text Generation • 8B • Updated May 26, 2025 • 4

TingchenFu/sft_8k_qwen-2.5-1.5b_05022300

Text Generation • 2B • Updated May 26, 2025 • 2

TingchenFu/sft_8k_qwen-2.5-7b_05021953

Text Generation • 8B • Updated May 26, 2025 • 6

TingchenFu/sft_8k_qwen-2.5-math-1.5b_05021751

Text Generation • 2B • Updated May 26, 2025 • 4

TingchenFu/sft_8k_qwen-2.5-math-7b_05021445

Text Generation • 8B • Updated May 26, 2025 • 4

TingchenFu/sftrl_7k_qwen-2.5-1.5b_05070032

Text Generation • 2B • Updated May 26, 2025 • 4

TingchenFu/sftrl_7k_qwen-2.5-math-1.5b_05052256

Text Generation • 2B • Updated May 26, 2025 • 3

TingchenFu/sftrl_7k_qwen-2.5-math-7b_05040001

Text Generation • 8B • Updated May 26, 2025 • 4

TingchenFu/sftrl_7k_qwen-2.5-7b_05042309

Text Generation • 8B • Updated May 26, 2025 • 4

Salesforce/E1-AceReason-14B

Text Generation • 15B • Updated Jun 1, 2025 • 14 • 12

mradermacher/E1-AceReason-14B-GGUF

15B • Updated Jun 1, 2025 • 27 • 2

mradermacher/E1-AceReason-14B-i1-GGUF

15B • Updated Jun 1, 2025 • 2.91k

sizzlebop/E1-AceReason-14B-Q8_0-GGUF

15B • Updated Jun 1, 2025 • 8 • 1

sizzlebop/AdaptThink-7B-delta0.05-Q8_0-GGUF

8B • Updated Jun 1, 2025 • 6

sizzlebop/AdaptThink-7B-delta0.05-IQ4_XS-GGUF

8B • Updated Jun 1, 2025 • 9

Khurram123/E1-Math-1.5B-Q4_K_M-GGUF

2B • Updated Jun 5, 2025 • 3

Xuerui2312/DeepSeek-R1-Distill-Qwen-7B-TRPA-DeepScaleR-verl0326

Text Generation • 8B • Updated Jun 20, 2025 • 18 • 1

hdong0/deepseek-Llama-8B-Open-R1-GRPO_deepscaler_1000steps_lr1e-6_kl1e-3_acc

Text Generation • 8B • Updated Jun 15, 2025 • 3

tensorblock/Vinnnf_Thinkless-1.5B-RL-DeepScaleR-GGUF

Text Generation • 2B • Updated Jul 9, 2025 • 117

hdong0/deepseek-Qwen2.5-1.5B-baseline-Open-R1-GRPO_deepscaler_mu_8

Text Generation • 2B • Updated Jul 4, 2025 • 3

hdong0/deepseek-Qwen2.5-1.5B-Open-R1-GRPO_deepscaler_mu_8

Text Generation • 2B • Updated Jul 4, 2025 • 1

hdong0/Qwen2.5-Math-1.5B-Open-R1-GRPO_deepscaler_mu_8_constant_lr

Text Generation • 2B • Updated Jul 7, 2025 • 3

hdong0/deepseek-Qwen-1.5B-Open-R1-GRPO_deepscaler_mu_8_constant_lr

Text Generation • 2B • Updated Jul 7, 2025 • 2

hdong0/Qwen2.5-Math-1.5B-baseline-Open-R1-GRPO_deepscaler_mu_8_constant_lr

Text Generation • 2B • Updated Jul 8, 2025 • 2

ZhenghaiXue/Qwen2.5-7B-SimpleTIR

Reinforcement Learning • 8B • Updated Jul 8, 2025 • 92 • 1