Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2512.20848 • Published 6 days ago • 28
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21 • 245
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 12 days ago • 85
Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning Paper • 2512.10534 • Published 18 days ago • 31
One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation Paper • 2512.07829 • Published 21 days ago • 21
view article Article MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier 17 days ago • 21
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning Paper • 2511.18659 • Published Nov 24 • 18
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23 • 278
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published 28 days ago • 93
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published 21 days ago • 74
view article Article Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance 20 days ago • 82
Tarka Embed V1 Collection Efficient DFKD embeddings for language understanding • 5 items • Updated 12 days ago • 6