Edit Models filters

Apps

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

3,174

Full-text search

Active filters: reasoning

mradermacher/Diver-Retriever-4B-1020-GGUF

4B • Updated Oct 21 • 167

mradermacher/Qwen3-RA-TNG-1220-6B-i1-GGUF

6B • Updated Oct 22 • 231

faresfawzi/Qwen3-8B-SCRIBE

Text Generation • 8B • Updated 25 days ago • 58

nightmedia/Qwen3-4B-Thinking-2507-Esper3.1-qx86-hi-mlx

Text Generation • 1B • Updated Oct 21 • 14

nightmedia/Qwen3-RA-TNG-1809-6B-qx86-hi-mlx

Text Generation • 6B • Updated 29 days ago • 23

asdwvv/Smoothie-Qwen3-32B-Q4_K_M-GGUF

Text Generation • 33B • Updated Oct 22 • 31

mradermacher/Qwen3-6B-Almost-Human-XMEN-X4-X2-X1-Dare-e32-GGUF

6B • Updated Oct 22 • 92

mradermacher/Qwen3-6B-Almost-Human-XMEN-X4-X2-X1-Dare-e32-i1-GGUF

6B • Updated Oct 22 • 188

nightmedia/Qwen3-RA-b-TNG-320-6B-qx86-hi-mlx

Text Generation • 6B • Updated 29 days ago • 23

DavidAU/Qwen3-MOE-6Bx4-Almost-Human-XMEN-X3-X4-X2-X1-24B

Text Generation • 19B • Updated about 1 month ago • 9

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_294_FlashRL_G4-L1024

Reinforcement Learning • 2B • Updated Oct 22 • 2

mradermacher/Qwen3-8B-SCRIBE-GGUF

8B • Updated 23 days ago • 208

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_588_FlashRL_G4-L1024

Reinforcement Learning • 2B • Updated Oct 22 • 2

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L1024

Reinforcement Learning • 2B • Updated Oct 22 • 1

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L1024

Reinforcement Learning • 2B • Updated Oct 22 • 6

mradermacher/Qwen3-6B-Almost-Human-XMEN-X3-X4-X2-X1-Dare-GGUF

6B • Updated about 1 month ago • 536 • 1

mradermacher/Qwen3-6B-Almost-Human-XMEN-X3-X4-X2-X1-Dare-Complex-GGUF

6B • Updated 30 days ago • 176

samhitha2601/llama3.2-3b-ppo

Reinforcement Learning • Updated about 1 month ago • 7

samhitha2601/llama3.2-3b-ppo-critic

Reinforcement Learning • Updated about 1 month ago • 5

mradermacher/Qwen3-MOE-6Bx4-Almost-Human-XMEN-X3-X4-X2-X1-24B-GGUF

19B • Updated 30 days ago • 626

remyxai/SpaceQwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated about 1 month ago • 44 • 2

ziadrone/airesupdated-v4

Text Generation • 4B • Updated about 1 month ago • 4

mradermacher/gpt-oss-20b-Esper3.1-GGUF

21B • Updated about 1 month ago • 788 • 1

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_294_FlashRL_G4-L2048_new

Reinforcement Learning • 2B • Updated about 1 month ago • 499

mradermacher/gpt-oss-20b-Esper3.1-i1-GGUF

21B • Updated 30 days ago • 1.53k • 1

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_588_FlashRL_G4-L2048_new

Reinforcement Learning • 2B • Updated 30 days ago • 360

mradermacher/Qwen3-6B-Almost-Human-XMEN-X3-X4-X2-X1-Dare-Complex-i1-GGUF

6B • Updated 30 days ago • 1.78k

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L2048_new

Reinforcement Learning • 2B • Updated 30 days ago • 365

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L2048_new

Reinforcement Learning • 2B • Updated 30 days ago • 513

mradermacher/Almost-Human-X3-32bit-1839-6B-GGUF

6B • Updated 29 days ago • 695