facebook/metaclip-h14-fullcc2.5b Zero-Shot Image Classification • 1.0B • Updated Jan 11, 2024 • 26.6k • 44
Audio-Reasoner: Improving Reasoning Capability in Large Audio Language Models Paper • 2503.02318 • Published Mar 4 • 1
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Paper • 2503.03983 • Published Mar 6 • 26