Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning Paper • 2601.11109 • Published 17 days ago • 2
Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens Paper • 2511.19418 • Published Nov 24, 2025 • 29