MiniCPM-o & MiniCPM-V Collection Multimodal models with leading performance. • 29 items • Updated 9 days ago • 72
Nemotron v3 Pre-Training Collection Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 2 minutes ago • 9
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 27 items • Updated 2 minutes ago • 74
view article Article Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time Feb 18, 2025 • 35