OCR, VQA, Thinking and Object Detection.
Convert invoices to structured JSON
Extract text from images and PDFs
Ask questions about images or PDFs
Ask questions about images using Moondream2 or SmolVLM
llava sign model for sign/stamp detection
To detect document type