Self Forcing Wan 2.1
Real-time video generation
Real-time video generation
image2mesh
Generate audio from text prompts
Generate speech from text
Generate 3D models from text descriptions
Audio-Driven Multi-Person Conversational Video Generation
A Unified Framework for Image Customization
Try out Mistral's latest OCR with pdfs and images
Generate responses to text prompts in a chat interface
Demo for Nanonets-OCR
Chat with MedGemma 4B, a medical variant of Gemma 3
Generate medically-informed responses using prompts
Transcribe audio to text with timestamps
Conversational speech generation
Transcribe English audio to text
Clarity AI Upscaler Reproduction
Upscale low-resolution images to high resolution
Generate a custom song with lyrics and optional prompts
OmniGen2: Unified Image Understanding and Generation.