Speech to Speech - a alshell7 Collection

alshell7 's Collections

Medical

General

Speech to Speech

Small/Tiny Models

Speech to Speech

updated Sep 18, 2025

Qwen/Qwen2.5-Omni-3B

Any-to-Any • 6B • Updated Apr 30, 2025 • 220k • 325
Running on CPU Upgrade

Featured

1.2k

Open ASR Leaderboard

🏆

1.2k

View and request speech models benchmark data
fishaudio/openaudio-s1-mini

Text-to-Speech • Updated Jun 2, 2025 • 4.23k • 555
fluxions/vui

Text-to-Speech • Updated Jun 17, 2025 • 437 • 146
OpenMOSS-Team/MOSS-TTSD-v0

Text-to-Speech • 2B • Updated Jun 20, 2025 • 8 • 27
nvidia/audio-flamingo-3

Audio-Text-to-Text • Updated Nov 28, 2025 • 938 • 140
bosonai/higgs-audio-v2-generation-3B-base

Text-to-Speech • 6B • Updated Jul 28, 2025 • 194k • 652
Vyvo/VyvoTTS-v0-Qwen3-0.6B

Text-to-Speech • 0.8B • Updated Aug 9, 2025 • 189 • 25
nvidia/canary-1b-v2

Automatic Speech Recognition • Updated Dec 3, 2025 • 80.5k • 336
nvidia/diar_streaming_sortformer_4spk-v2

Automatic Speech Recognition • Updated 20 days ago • 9.54k • 95
microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Sep 1, 2025 • 296k • 2.17k
stepfun-ai/Step-Audio-2-mini

Any-to-Any • 8B • Updated Sep 5, 2025 • 586 • 244
FireRedTeam/FireRedTTS2

Updated Sep 17, 2025 • 64