Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning Paper • 2512.20605 • Published 6 days ago • 51
Spatia: Video Generation with Updatable Spatial Memory Paper • 2512.15716 • Published 12 days ago • 22
Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation Paper • 2512.17040 • Published 10 days ago • 27
4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation Paper • 2512.17012 • Published 10 days ago • 42
Insight Miner: A Time Series Analysis Dataset for Cross-Domain Alignment with Natural Language Paper • 2512.11251 • Published 17 days ago • 6
StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors Paper • 2512.16915 • Published 11 days ago • 37
Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation Paper • 2512.16913 • Published 11 days ago • 33
LLaDA2.0: Scaling Up Diffusion Language Models to 100B Paper • 2512.15745 • Published 19 days ago • 77
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published 12 days ago • 41
Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models Paper • 2512.14008 • Published 13 days ago • 9
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published 13 days ago • 65
Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching Paper • 2512.11130 • Published 17 days ago • 4
Exploring MLLM-Diffusion Information Transfer with MetaCanvas Paper • 2512.11464 • Published 17 days ago • 12
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties Paper • 2512.11799 • Published 17 days ago • 29
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 20 days ago • 112
Evaluating Gemini Robotics Policies in a Veo World Simulator Paper • 2512.10675 • Published 18 days ago • 16