Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging Paper • 2505.05464 • Published May 8 • 11
Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas Paper • 2503.01773 • Published Mar 3
FELM: Benchmarking Factuality Evaluation of Large Language Models Paper • 2310.00741 • Published Oct 1, 2023
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models Paper • 2305.08322 • Published May 15, 2023
Composing Parameter-Efficient Modules with Arithmetic Operations Paper • 2306.14870 • Published Jun 26, 2023 • 3