Uzair2108 (Uzair Ahmed)

Self Forcing Wan 2.1

🎥

322

Real-time video generation

Step1X 3D

🐨

248

image2mesh

Stable Audio Open Zero

🔥

450

Generate audio from text prompts

OpenAudio S1

🏆

680

Generate speech from text

Direct3D S2 V1.0 Demo

💻

424

Generate 3D models from text descriptions

Meigen MultiTalk

🎙

268

Audio-Driven Multi-Person Conversational Video Generation

DreamO

🐨

600

A Unified Framework for Image Customization

Mistral OCR 3

🌆

72

Try out Mistral's latest OCR with pdfs and images

Qwen3 Demo

📊

825

Generate responses to text prompts in a chat interface

Nanonets OCR

👁

82

Demo for Nanonets-OCR

MedGemma 4B IT

🩻

34

Chat with MedGemma 4B, a medical variant of Gemma 3

Medgemma 27b Text It

😻

12

Generate medically-informed responses using prompts

Parakeet-TDT-0.6b-V2

456

Transcribe audio to text with timestamps

Sesame CSM

🌱

856

Conversational speech generation

Kyutai STT 2.6B EN

😻

7

Transcribe English audio to text

Kyutai Tts Test

🐨

2

Finegrain Image Enhancer

🖼

1.95k

Clarity AI Upscaler Reproduction

Flux.1-dev Upscaler

🔎

1.62k

Upscale low-resolution images to high resolution

Song Generation

🎵

578

Generate a custom song with lyrics and optional prompts

OmniGen2

👀

427

OmniGen2: Unified Image Understanding and Generation.

Uzair Ahmed

AI & ML interests

Organizations

Self Forcing Wan 2.1

Step1X 3D

Stable Audio Open Zero

OpenAudio S1

Direct3D S2 V1.0 Demo

Meigen MultiTalk

DreamO

Mistral OCR 3

Qwen3 Demo

Nanonets OCR

MedGemma 4B IT

Medgemma 27b Text It

Parakeet-TDT-0.6b-V2

Sesame CSM

Kyutai STT 2.6B EN

Kyutai Tts Test

Finegrain Image Enhancer

Flux.1-dev Upscaler

Song Generation

OmniGen2

Uzair Ahmed

AI & ML interests

Organizations

Uzair2108's activity

Self Forcing Wan 2.1

Step1X 3D

Stable Audio Open Zero

OpenAudio S1

Direct3D S2 V1.0 Demo

Meigen MultiTalk

DreamO

Mistral OCR 3

Qwen3 Demo

Nanonets OCR

MedGemma 4B IT

Medgemma 27b Text It

Parakeet-TDT-0.6b-V2

Sesame CSM

Kyutai STT 2.6B EN

Kyutai Tts Test

Finegrain Image Enhancer

Flux.1-dev Upscaler

Song Generation

OmniGen2