Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
adhisetiawan
's Collections
Papers
Multimodal Models
SLMs
LLMs
Audio
Multimodal Papers
Multimodal Models
updated
May 27, 2024
Upvote
-
microsoft/kosmos-2-patch14-224
Image-to-Text
•
2B
•
Updated
Nov 28, 2023
•
219k
•
179
Tyrannosaurus/TinyGPT-V
Updated
Jan 19, 2024
•
50
naver-clova-ix/donut-base
Image-to-Text
•
Updated
Aug 13, 2022
•
68.1k
•
234
llava-hf/llava-v1.6-34b-hf
Image-Text-to-Text
•
35B
•
Updated
Jan 27
•
3.11k
•
91
deepseek-ai/deepseek-vl-7b-base
7B
•
Updated
Mar 15, 2024
•
439
•
63
deepseek-ai/deepseek-vl-7b-chat
Image-Text-to-Text
•
7B
•
Updated
Mar 15, 2024
•
5.48k
•
263
vikhyatk/moondream2
Image-Text-to-Text
•
2B
•
Updated
Sep 23
•
1.72M
•
1.34k
zai-org/cogvlm-chat-hf
Text Generation
•
18B
•
Updated
Dec 19, 2023
•
2.62k
•
199
Qwen/Qwen-VL-Chat
Text Generation
•
Updated
Jan 25, 2024
•
38.7k
•
376
Qwen/Qwen-VL
Text Generation
•
Updated
Jan 25, 2024
•
19k
•
265
microsoft/git-base
Image-to-Text
•
0.2B
•
Updated
Apr 24, 2023
•
15.1k
•
106
Upvote
-
Share collection
View history
Collection guide
Browse collections