FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention Paper • 2512.01540 • Published 11 days ago • 3
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling Paper • 2512.03000 • Published 10 days ago • 35
Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model Paper • 2512.01030 • Published 12 days ago • 17
Multi-view Pyramid Transformer: Look Coarser to See Broader Paper • 2512.07806 • Published 4 days ago • 20
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos Paper • 2508.14041 • Published Aug 19 • 59