sroecker
/

Qwen-1.B-GRPO-gsm8k-1000

text-generation-inference

Model card Files Files and versions

Qwen-1.B-GRPO-gsm8k-1000

164 MB

1 contributor

History: 4 commits

sroecker's picture

Trained with Unsloth

40b1c5d verified 9 months ago