an open-vocabulary sound event detection model
Generate and edit audio from text prompts
Stylized TTS – design voice, accent, and emotion your way
Separate sounds from audio mixtures using text prompts
State-of-the-art target speech extractor