Qwen2.5-0.5B-GRPO / README.md
ZyKINvice's picture
Improve language tag (#1)
dae7332 verified
metadata
license: mit
datasets:
  - openai/gsm8k
language:
  - zho
  - eng
  - fra
  - spa
  - por
  - deu
  - ita
  - rus
  - jpn
  - kor
  - vie
  - tha
  - ara
base_model:
  - Qwen/Qwen2.5-0.5B-Instruct