Chethan Kumar D A
chethan62
AI & ML interests
tech
Recent Activity
liked
a model
about 3 hours ago
prithivMLmods/Nanonets-OCR-s-AIO-GGUF
liked
a model
2 days ago
ggerganov/whisper.cpp
liked
a model
2 days ago
dx8152/Qwen-Edit-2509-Multiple-angles
Organizations
None yet
TTS
spaces
-
Runtime error2.77k2.77k
XTTS
🐸Generate speech from text using a reference voice
-
Running on Zero3535
Moonshine ASR
🌒Fast & efficient ASR outperforming Whisper!
-
Running922922
Edge TTS Text To Speech
👁Generate speech from text using Microsoft Edge TTS
-
Paused845845
Video Dubbing (SoniTranslate)
🌍Video Dubbing with Open Source Projects
papers
-
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper • 2311.10093 • Published • 59 -
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation
Paper • 2311.12229 • Published • 27 -
Diffusion Model Alignment Using Direct Preference Optimization
Paper • 2311.12908 • Published • 50 -
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper • 2312.00845 • Published • 39
STT
TTS
Ai
spaces
-
Runtime error2.77k2.77k
XTTS
🐸Generate speech from text using a reference voice
-
Running on Zero3535
Moonshine ASR
🌒Fast & efficient ASR outperforming Whisper!
-
Running922922
Edge TTS Text To Speech
👁Generate speech from text using Microsoft Edge TTS
-
Paused845845
Video Dubbing (SoniTranslate)
🌍Video Dubbing with Open Source Projects
webgpu
papers
-
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper • 2311.10093 • Published • 59 -
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation
Paper • 2311.12229 • Published • 27 -
Diffusion Model Alignment Using Direct Preference Optimization
Paper • 2311.12908 • Published • 50 -
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper • 2312.00845 • Published • 39
models