DavidAU/Qwen3-MOE-6Bx4-Almost-Human-XMEN-X3-X4-X2-X1-24B Text Generation • 19B • Updated about 1 month ago • 9
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_294_FlashRL_G4-L1024 Reinforcement Learning • 2B • Updated Oct 22 • 2
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_588_FlashRL_G4-L1024 Reinforcement Learning • 2B • Updated Oct 22 • 2
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L1024 Reinforcement Learning • 2B • Updated Oct 22 • 1
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L1024 Reinforcement Learning • 2B • Updated Oct 22 • 6
mradermacher/Qwen3-6B-Almost-Human-XMEN-X3-X4-X2-X1-Dare-GGUF 6B • Updated about 1 month ago • 536 • 1
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_294_FlashRL_G4-L2048_new Reinforcement Learning • 2B • Updated about 1 month ago • 499
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_588_FlashRL_G4-L2048_new Reinforcement Learning • 2B • Updated 30 days ago • 360
mradermacher/Qwen3-6B-Almost-Human-XMEN-X3-X4-X2-X1-Dare-Complex-i1-GGUF 6B • Updated 30 days ago • 1.78k
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L2048_new Reinforcement Learning • 2B • Updated 30 days ago • 365
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L2048_new Reinforcement Learning • 2B • Updated 30 days ago • 513