Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark Paper • 2510.26802 • Published 21 days ago • 32
Iterative Prompt Relabeling for diffusion model with RLDF Paper • 2312.16204 • Published Dec 23, 2023
EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions? Paper • 2503.16545 • Published Mar 19 • 1
MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning Paper • 2506.05331 • Published Jun 5 • 13
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Paper • 2502.09621 • Published Feb 13 • 28