Malaysian Reasoning
Collection
Full parameter post training using SFT warmup and GRPO.
•
10 items
•
Updated
•
1
Initial LoRA mesolitica/Malaysian-Qwen2.5-72B-Instruct on https://huggingface.co/datasets/mesolitica/Malaysian-Reasoning/commit/e1bb8a2141a1db351321d988687432d312495905 to introduce Malaysian reasoning.
This model been use to generate mesolitica/Malaysian-Reasoning by using few shots prompts.
Special thanks to https://www.sns.com.my and Nvidia for 8x H100 node!