Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2406.16852
Vision-Language
Collection by Aug 13, 2024
1
  • SILC: Improving Vision Language Pretraining with Self-Distillation

    Paper • 2310.13355 • Published Oct 20, 2023 • 9
  • Woodpecker: Hallucination Correction for Multimodal Large Language Models

    Paper • 2310.16045 • Published Oct 24, 2023 • 17
  • BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

    Paper • 2201.12086 • Published Jan 28, 2022 • 3
  • ImageNetVC: Zero-Shot Visual Commonsense Evaluation on 1000 ImageNet Categories

    Paper • 2305.15028 • Published May 24, 2023 • 1
Vision-Language
Collection by Aug 13, 2024
1
  • SILC: Improving Vision Language Pretraining with Self-Distillation

    Paper • 2310.13355 • Published Oct 20, 2023 • 9
  • Woodpecker: Hallucination Correction for Multimodal Large Language Models

    Paper • 2310.16045 • Published Oct 24, 2023 • 17
  • BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

    Paper • 2201.12086 • Published Jan 28, 2022 • 3
  • ImageNetVC: Zero-Shot Visual Commonsense Evaluation on 1000 ImageNet Categories

    Paper • 2305.15028 • Published May 24, 2023 • 1
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs