Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mesolitica
's Collections
Audio Language Model
Malaysian Reasoning
Malaysian Finetuned Instruct LoRA
Malaysian Speech-to-Text
Malaysian Text-to-Speech
Malaysian Translation
Malaysian pretraining dataset
Malaysian instruction dataset
MaLLaM 🌙
Malaysian CausalLM
Malaysian LLM2Vec
Malaysian Seq2Seq
Malaysian MaskLM
Malaysian Reasoning
updated
5 days ago
Full parameter post training using SFT warmup and GRPO.
Upvote
1
mesolitica/Malaysian-Qwen2.5-1.5B-Reasoning-SFT
2B
•
Updated
Jun 18
•
4
mesolitica/Malaysian-Qwen2.5-1.5B-Reasoning-GRPO
2B
•
Updated
Jun 18
•
5
mesolitica/Malaysian-Qwen2.5-7B-Reasoning-SFT
8B
•
Updated
Jun 18
•
60
•
1
mesolitica/Malaysian-Qwen2.5-7B-Dialect-Reasoning-GRPO
8B
•
Updated
Jun 4
•
1
•
3
mesolitica/Malaysian-Qwen2.5-14B-Reasoning-SFT
15B
•
Updated
Jun 18
•
4
mesolitica/Malaysian-Qwen2.5-14B-Reasoning-GRPO
15B
•
Updated
Jun 18
•
6
•
1
mesolitica/Malaysian-Qwen2.5-72B-Reasoning-SFT-v0.1
73B
•
Updated
May 27
mesolitica/Malaysian-Reasoning
Viewer
•
Updated
May 28
•
32.3k
•
24
mesolitica/Malaysian-Reasoning-Speech-Instructions
Viewer
•
Updated
Jun 2
•
25.2k
•
27
mesolitica/Malay-Dialect-Reasoning
Viewer
•
Updated
Jun 16
•
9.13k
•
17
•
1
Upvote
1
Share collection
View history
Collection guide
Browse collections