-
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
Paper • 2511.04570 • Published • 185 -
V-Thinker: Interactive Thinking with Images
Paper • 2511.04460 • Published • 91 -
TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning
Paper • 2511.01833 • Published • 15 -
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
Paper • 2510.27492 • Published • 78
ChenJing
CelesteChen
·
AI & ML interests
Computer Vision, Natural Language Generation
Recent Activity
updated
a collection
about 8 hours ago
visual thinker
updated
a collection
about 8 hours ago
others
updated
a collection
about 8 hours ago
multimodal
Organizations
None yet