InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper • 2508.18265 • Published Aug 25, 2025 • 212
OpenGVLab/InternVL3_5-241B-A28B Image-Text-to-Text • 241B • Updated Aug 29, 2025 • 2.01k • 134
meituan-longcat/LongCat-Flash-Thinking Text Generation • 562B • Updated Sep 24, 2025 • 53 • 147
ibm-granite/granite-docling-258M Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 215k • 1.11k
PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated about 20 hours ago • 15.9k • 1.54k