Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published 4 days ago • 72
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix By codelion • 10 days ago • 34
view article Article Why Did MiniMax M2 End Up as a Full Attention Model? By MiniMax-AI • 14 days ago • 59
view article Article ⚡ Power, Heat, and Intelligence ☁️ - AI Data Centers Explained 🏭 By sasha and 1 other • 8 days ago • 13
OlmoEarth Collection OlmoEarth pre-trained and fine-tuned foundation models for remote sensing • 10 items • Updated 10 days ago • 11
view article Article Classement compar:IA : des votes des utilisateurs au classement participatif des modèles By comparIA • 10 days ago • 6
NemoGuard Collection Essential datasets and models for content safety, topic-following, and security guardrails • 11 items • Updated 4 days ago • 11
gpt-oss-safeguard Collection gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated 15 days ago • 56
view article Article 3+ Years of ML & Society at Hugging Face 🤗🤝🧑🤝🧑 By yjernite and 3 others • 15 days ago • 13
view article Article On the Shifting Global Compute Landscape By huggingface and 1 other • 15 days ago • 49
view article Article ☁️ When we pay for AI cloud compute, what are we really paying for? 💲 By sasha and 1 other • 16 days ago • 3
view article Article Granite 4.0 Nano: Just how small can you go? By ibm-granite and 1 other • 16 days ago • 112
view article Article huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning 18 days ago • 62