AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L2048_new Reinforcement Learning • 2B • Updated 25 days ago • 503
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L2048_new Reinforcement Learning • 2B • Updated 25 days ago • 355
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_588_FlashRL_G4-L2048_new Reinforcement Learning • 2B • Updated 25 days ago • 357
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_294_FlashRL_G4-L2048_new Reinforcement Learning • 2B • Updated 25 days ago • 498
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L1024 Reinforcement Learning • 2B • Updated 27 days ago • 163
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L1024 Reinforcement Learning • 2B • Updated 27 days ago • 25
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_588_FlashRL_G4-L1024 Reinforcement Learning • 2B • Updated 27 days ago • 22
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_294_FlashRL_G4-L1024 Reinforcement Learning • 2B • Updated 27 days ago • 13
AzalKhan/Qwen2.5-1.5B-Instruct_open-r1-DAPO-Math-17k-Processed_294 Reinforcement Learning • 2B • Updated Oct 10 • 21
AzalKhan/Qwen2.5-1.5B_open-r1-DAPO-Math-17k-Processed_882 Reinforcement Learning • 2B • Updated Oct 8 • 10