Generate financial insights from text
Extract text from images and PDFs
Transcribe audio to text using Whisper model
Qwen3-VL / Qwen2.5-VL
Recognize text and elements in images
Generate speech from text using a reference audio sample