SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning Paper • 2602.13515 • Published 7 days ago • 30
DR-LoRA: Dynamic Rank LoRA for Mixture-of-Experts Adaptation Paper • 2601.04823 • Published Jan 8 • 7
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors Paper • 2601.07226 • Published Jan 12 • 33
From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors Paper • 2510.17439 • Published Oct 20, 2025 • 28
VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model Paper • 2602.10098 • Published 11 days ago • 18
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models Paper • 2411.00918 • Published Nov 1, 2024 • 9
Biases in the Blind Spot: Detecting What LLMs Fail to Mention Paper • 2602.10117 • Published 11 days ago • 1
Learning on the Manifold: Unlocking Standard Diffusion Transformers with Representation Encoders Paper • 2602.10099 • Published 11 days ago • 3
ALIVE: Animate Your World with Lifelike Audio-Video Generation Paper • 2602.08682 • Published 12 days ago • 2
Multi-agent cooperation through in-context co-player inference Paper • 2602.16301 • Published 3 days ago • 13
On Surprising Effectiveness of Masking Updates in Adaptive Optimizers Paper • 2602.15322 • Published 4 days ago • 9
AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories Paper • 2602.14941 • Published 5 days ago • 5
WebWorld: A Large-Scale World Model for Web Agent Training Paper • 2602.14721 • Published 5 days ago • 7
ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces Paper • 2602.11683 • Published 9 days ago • 7
BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published 6 days ago • 42
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook Paper • 2602.14299 • Published 6 days ago • 24
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning Paper • 2602.10090 • Published 11 days ago • 49