The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization Paper • 2006.16241 • Published Jun 29, 2020
Dynamic Reflections: Probing Video Representations with Text Alignment Paper • 2511.02767 • Published 15 days ago • 3
Dynamic Reflections: Probing Video Representations with Text Alignment Paper • 2511.02767 • Published 15 days ago • 3
Dynamic Reflections: Probing Video Representations with Text Alignment Paper • 2511.02767 • Published 15 days ago • 3 • 2
COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning Paper • 2504.21850 • Published Apr 30 • 27
Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images Paper • 2504.09621 • Published Apr 13 • 11
Attention IoU: Examining Biases in CelebA using Attention Maps Paper • 2503.19846 • Published Mar 25 • 7
Attention IoU: Examining Biases in CelebA using Attention Maps Paper • 2503.19846 • Published Mar 25 • 7
Unifying Specialized Visual Encoders for Video Language Models Paper • 2501.01426 • Published Jan 2 • 21
Unifying Specialized Visual Encoders for Video Language Models Paper • 2501.01426 • Published Jan 2 • 21 • 2
xT: Nested Tokenization for Larger Context in Large Images Paper • 2403.01915 • Published Mar 4, 2024 • 1
Unifying Specialized Visual Encoders for Video Language Models Paper • 2501.01426 • Published Jan 2 • 21