When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought Paper • 2511.02779 • Published 5 days ago • 52
Interpreting Object-level Foundation Models via Visual Precision Search Paper • 2411.16198 • Published Nov 25, 2024 • 2
Object Detectors in the Open Environment: Challenges, Solutions, and Outlook Paper • 2403.16271 • Published Mar 24, 2024 • 1
Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Generation Paper • 2509.22496 • Published Sep 26 • 3
Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Generation Paper • 2509.22496 • Published Sep 26 • 3
AHELM: A Holistic Evaluation of Audio-Language Models Paper • 2508.21376 • Published Aug 29 • 9
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models Paper • 2504.11468 • Published Apr 10 • 30
Less is More: Fewer Interpretable Region via Submodular Subset Selection Paper • 2402.09164 • Published Feb 14, 2024 • 2
Less is More: Efficient Black-box Attribution via Minimal Interpretable Subset Selection Paper • 2504.00470 • Published Apr 1
Object Detectors in the Open Environment: Challenges, Solutions, and Outlook Paper • 2403.16271 • Published Mar 24, 2024 • 1
Less is More: Fewer Interpretable Region via Submodular Subset Selection Paper • 2402.09164 • Published Feb 14, 2024 • 2
Interpreting Object-level Foundation Models via Visual Precision Search Paper • 2411.16198 • Published Nov 25, 2024 • 2
GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published Jan 30 • 88