Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
adhisetiawan 's Collections
Papers
Multimodal Models
SLMs
LLMs
Audio
Multimodal Papers

Multimodal Models

updated May 27, 2024
Upvote
-

  • microsoft/kosmos-2-patch14-224

    Image-to-Text • 2B • Updated Nov 28, 2023 • 219k • 179

  • Tyrannosaurus/TinyGPT-V

    Updated Jan 19, 2024 • 50

  • naver-clova-ix/donut-base

    Image-to-Text • Updated Aug 13, 2022 • 68.1k • 234

  • llava-hf/llava-v1.6-34b-hf

    Image-Text-to-Text • 35B • Updated Jan 27 • 3.11k • 91

  • deepseek-ai/deepseek-vl-7b-base

    7B • Updated Mar 15, 2024 • 439 • 63

  • deepseek-ai/deepseek-vl-7b-chat

    Image-Text-to-Text • 7B • Updated Mar 15, 2024 • 5.48k • 263

  • vikhyatk/moondream2

    Image-Text-to-Text • 2B • Updated Sep 23 • 1.72M • 1.34k

  • zai-org/cogvlm-chat-hf

    Text Generation • 18B • Updated Dec 19, 2023 • 2.62k • 199

  • Qwen/Qwen-VL-Chat

    Text Generation • Updated Jan 25, 2024 • 38.7k • 376

  • Qwen/Qwen-VL

    Text Generation • Updated Jan 25, 2024 • 19k • 265

  • microsoft/git-base

    Image-to-Text • 0.2B • Updated Apr 24, 2023 • 15.1k • 106
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs