16
UGround
📱
Extract text from images using various OCR modes
Track points in a video
Describe image contents with prompts
Generate responses to video or image inputs
A data extraction tool to convert PDF to Markdown and JSON
Visual Retrieval with ColPali and Vespa
Generate clickable coordinates on a screenshot
Demo for https://github.com/Byaidu/PDFMathTranslate
Controlling Computers with Small Models
Generate code snippets with AnyCoder