VTP Collection Towards Scalable Pre-training of Visual Tokenizers for Generation • 4 items • Updated 1 day ago • 35
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper • 2512.13687 • Published 2 days ago • 78
InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models Paper • 2512.08829 • Published 8 days ago • 17
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 • 74
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning Paper • 2502.13144 • Published Feb 18 • 38
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation Paper • 2502.13145 • Published Feb 18 • 38