Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
PaddlePaddle 's Collections
PaddleOCR-VL
PP-StructureV3
PP-OCRv5
PP-OCRv4
PP-OCRv3

PaddleOCR-VL

updated 11 days ago

Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Upvote
23

  • PaddlePaddle/PaddleOCR-VL

    Image-Text-to-Text • 1.0B • Updated 16 days ago • 17.9k • 1.43k

  • Running
    Featured
    210

    PaddleOCR-VL Online Demo

    📈
    210

    Parse and recognize text in images


  • PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

    Paper • 2510.14528 • Published Oct 16 • 108
Upvote
23
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs