Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
VibeVoice-Realtime-0.5B
like
888
Follow
Microsoft
17.2k
Text-to-Speech
Transformers
Safetensors
English
vibevoice_streaming
Realtime TTS
Streaming text input
Long-form speech generation
arxiv:
2508.19205
arxiv:
2412.08635
License:
mit
Model card
Files
Files and versions
xet
Community
18
Deploy
Use this model
refs/pr/5
VibeVoice-Realtime-0.5B
2.04 GB
6 contributors
History:
8 commits
gghfez
Update README.md
997cee2
verified
12 days ago
figures
add model overview
13 days ago
.gitattributes
Safe
1.57 kB
add model overview
13 days ago
README.md
9.46 kB
Update README.md
12 days ago
config.json
Safe
2.12 kB
add VibeVoice-Realtime-0.5B
13 days ago
model.safetensors
2.04 GB
xet
add VibeVoice-Realtime-0.5B
13 days ago
preprocessor_config.json
Safe
360 Bytes
add VibeVoice-Realtime-0.5B
13 days ago