Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tamazight-NLP 's Collections
Vision-Language Datasets
Image Classification Models
MT Models
Speech Datasets
Text Datasets
Bitext Datasets
OCR Datasets
Language Models
Encoders/Fill-Mask

Vision-Language Datasets

updated 17 days ago
Upvote
-

  • floschne/m5b_vlod

    Viewer • Updated 3 days ago • 1.42k • 8 • 1

  • floschne/m5b_vgr

    Viewer • Updated 3 days ago • 1.43k • 12 • 1

  • M5 -- A Diverse Benchmark to Assess the Performance of Large Multimodal Models Across Multilingual and Multicultural Vision-Language Tasks

    Paper • 2407.03791 • Published Jul 4, 2024 • 2

  • Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model

    Paper • 2501.05122 • Published Jan 9 • 20
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs