How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not Paper • 2409.17044 • Published Sep 25, 2024 • 3
Meetween's Research Papers Collection Research papers published within the MEETWEEN project • 31 items • Updated 22 days ago • 4
On Speculative Decoding for Multimodal Large Language Models Paper • 2404.08856 • Published Apr 13, 2024 • 13
BLINK: Multimodal Large Language Models Can See but Not Perceive Paper • 2404.12390 • Published Apr 18, 2024 • 26
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models Paper • 2404.13013 • Published Apr 19, 2024 • 31