mlfoundations/Gelato-30B-A3B Zero-Shot Object Detection • 31B • Updated about 21 hours ago • 994 • 13
brabooObrabo/Kimi-Linear-48B-A3B-Instruct-MXFP4-GS32-MLX Text Generation • 49B • Updated 4 days ago • 352 • 1
Running on CPU Upgrade 2.07k 2.07k The Smol Training Playbook: The Secrets to Building World-Class LLMs 📝 Display loss curves for training LLMs