AI & ML interests
None defined yet.
Multimodal model for better turn-taking
Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (frozen) backbone.
-
fixie-ai/ultravox-v0_6-llama-3_3-70b
Audio-Text-to-Text • 0.7B • Updated • 1.07k • 9 -
fixie-ai/ultravox-v0_6-gemma-3-27b
Audio-Text-to-Text • 0.7B • Updated • 1.66k • 8 -
fixie-ai/ultravox-v0_6-qwen-3-32b
Audio-Text-to-Text • 0.7B • Updated • 2.71k • 11 -
fixie-ai/ultravox-v0_6-llama-3_1-8b
Audio-Text-to-Text • 0.7B • Updated • 6.1k • 6
Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (fine-tuned) backbone.
-
fixie-ai/ultravox-v0_5-llama-3_3-70b
Audio-Text-to-Text • 0.7B • Updated • 23 • 32 -
fixie-ai/ultravox-v0_5-llama-3_1-8b
Audio-Text-to-Text • 0.7B • Updated • 1.64k • 34 -
fixie-ai/ultravox-v0_5-llama-3_2-1b
Audio-Text-to-Text • 0.7B • Updated • 334k • 68 -
fixie-ai/ultravox-v0_5-glm-4_5-355b
Audio-Text-to-Text • 0.7B • Updated • 1.41k • 2
-
fixie-ai/ultravox-v0_6-llama-3_3-70b
Audio-Text-to-Text • 0.7B • Updated • 1.07k • 9 -
fixie-ai/ultravox-v0_6-gemma-3-27b
Audio-Text-to-Text • 0.7B • Updated • 1.66k • 8 -
fixie-ai/ultravox-v0_6-qwen-3-32b
Audio-Text-to-Text • 0.7B • Updated • 2.71k • 11 -
fixie-ai/ultravox-v0_6-llama-3_1-8b
Audio-Text-to-Text • 0.7B • Updated • 6.1k • 6
Multimodal model for better turn-taking
Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (fine-tuned) backbone.
-
fixie-ai/ultravox-v0_5-llama-3_3-70b
Audio-Text-to-Text • 0.7B • Updated • 23 • 32 -
fixie-ai/ultravox-v0_5-llama-3_1-8b
Audio-Text-to-Text • 0.7B • Updated • 1.64k • 34 -
fixie-ai/ultravox-v0_5-llama-3_2-1b
Audio-Text-to-Text • 0.7B • Updated • 334k • 68 -
fixie-ai/ultravox-v0_5-glm-4_5-355b
Audio-Text-to-Text • 0.7B • Updated • 1.41k • 2
Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (frozen) backbone.