AzalKhan
/

Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L2048_new

Reinforcement Learning

text-generation

text-generation-inference

Model card Files Files and versions

Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L2048_new

6.19 GB

1 contributor

History: 4 commits

AzalKhan's picture

Upload folder using huggingface_hub

b83e337 verified 28 days ago