DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published 2 days ago • 39
Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed Paper • 2512.14067 • Published 4 days ago • 9
Rethinking Expert Trajectory Utilization in LLM Post-training Paper • 2512.11470 • Published 7 days ago • 5
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 16 items • Updated 2 days ago • 31
Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models Paper • 2512.13607 • Published 4 days ago • 18
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding Paper • 2512.13586 • Published 4 days ago • 85
Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models Paper • 2511.18890 • Published 25 days ago • 31
REASONEDIT: Towards Reasoning-Enhanced Image Editing Models Paper • 2511.22625 • Published 22 days ago • 46
How Far Are We from Genuinely Useful Deep Research Agents? Paper • 2512.01948 • Published 18 days ago • 53
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper • 2511.21689 • Published 23 days ago • 107
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published 22 days ago • 77
Scaling Spatial Intelligence with Multimodal Foundation Models Paper • 2511.13719 • Published Nov 17 • 45