LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics Paper • 2511.08544 • Published 9 days ago • 4
Simulating the Visual World with Artificial Intelligence: A Roadmap Paper • 2511.08585 • Published 9 days ago • 28
Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs Paper • 2510.11062 • Published Oct 13 • 28
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Paper • 2510.14967 • Published Oct 16 • 33
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding Paper • 2510.14943 • Published Oct 16 • 38
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization Paper • 2510.13554 • Published Oct 15 • 57
A^2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning Paper • 2510.12838 • Published Oct 13 • 23
Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report Paper • 2510.14880 • Published Oct 16 • 16
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published 22 days ago • 108
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents Paper • 2510.24702 • Published 23 days ago • 26
Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI Paper • 2511.01689 • Published 18 days ago • 4
PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold Paper • 2510.15862 • Published Oct 17 • 9
UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action Paper • 2510.17790 • Published Oct 20 • 5
Context Engineering 2.0: The Context of Context Engineering Paper • 2510.26493 • Published 22 days ago • 7