baidu/ERNIE-4.5-VL-28B-A3B-Thinking Image-Text-to-Text • 30B • Updated 4 days ago • 12.8k • 465
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16 Image-Text-to-Text • 13B • Updated 5 days ago • 18.4k • 50
moonshotai/Kimi-Linear-48B-A3B-Instruct Text Generation • 49B • Updated 1 day ago • 312k • 452
Running on CPU Upgrade 2.26k 2.26k The Smol Training Playbook 📚 The secrets to building world-class LLMs