Projects I've worked on (or contributed to)
mrfakename PRO
AI & ML interests
LLMs, TTS, & Open Source
Recent Activity
updated
a model
about 6 hours ago
mrfakename/versificator
updated
a model
about 6 hours ago
mrfakename/ReverseBERT-EmbeddingGemma-300M
published
a model
about 6 hours ago
mrfakename/ReverseBERT-EmbeddingGemma-300M
Organizations
SAM Audio
The SAM Audio model licenses allow for redistribution so long as the original license files are included
OpenF5 TTS
The OpenF5 TTS model series (currently OpenF5 TTS Base - more variants coming soon 👀)
Zero-Shot Voice Cloning
TTS models that support zero-shot voice cloning
-
MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
Paper • 2502.18924 • Published • 16 -
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer
Paper • 2409.00750 • Published • 5 -
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
Paper • 2410.06885 • Published • 46 -
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
Paper • 2409.10058 • Published • 2
Spaces of the Week
My spaces or spaces I worked featured on Spaces of the Week! Ones at the top are the oldest, newest at the bottom 🤗
-
Running on L4Featured714
StyleTTS 2
🗣714Efficient, fast, and natural text to speech with StyleTTS 2!
-
Running on ZeroFeatured416
OpenDalle V1.1 GPU Demo
🖼416A demo of OpenDalle V1.1 on a ZERO GPU.
-
RunningFeatured74
RWKV Music
🎵74Generate MIDI music using RWKV v4!
-
Running on CPU UpgradeFeatured914
TTS Arena V2
🏆914Vote on the latest TTS models!
Voice Acting Models
With LAION
Ministral 3 Llamafied
Ministral 3 models converted to the Llama format (without the vision encoder)
Podcast Pile
EmoAct
Llamafied Models
Models converted to the Llama format
-
mrfakename/Apriel-5B-Instruct-llamafied
Text Generation • 5B • Updated • 13 • 4 -
mrfakename/Apriel-5B-Base-llamafied
Text Generation • 5B • Updated • 13 -
llamafy/Qwen-Qwen2.5-1.5B-llamafied
Text Generation • 2B • Updated • 16 -
llamafy/Qwen-Qwen2.5-1.5B-Instruct-llamafied
Text Generation • 2B • Updated • 13
Failed Experiments
Experiments that didn't work out.
Projects
Projects I've worked on (or contributed to)
-
Running on CPU UpgradeFeatured914
TTS Arena V2
🏆914Vote on the latest TTS models!
-
laion/Emolia
Viewer • Updated • 71.8M • 13.4k • 8 -
mrfakename/OpenF5-TTS-Base
Text-to-Speech • Updated • 108 • 77 -
Running on ZeroFeatured2.75k
F5-TTS
🗣2.75kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Ministral 3 Llamafied
Ministral 3 models converted to the Llama format (without the vision encoder)
SAM Audio
The SAM Audio model licenses allow for redistribution so long as the original license files are included
Podcast Pile
OpenF5 TTS
The OpenF5 TTS model series (currently OpenF5 TTS Base - more variants coming soon 👀)
EmoAct
Zero-Shot Voice Cloning
TTS models that support zero-shot voice cloning
-
MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
Paper • 2502.18924 • Published • 16 -
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer
Paper • 2409.00750 • Published • 5 -
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
Paper • 2410.06885 • Published • 46 -
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
Paper • 2409.10058 • Published • 2
Llamafied Models
Models converted to the Llama format
-
mrfakename/Apriel-5B-Instruct-llamafied
Text Generation • 5B • Updated • 13 • 4 -
mrfakename/Apriel-5B-Base-llamafied
Text Generation • 5B • Updated • 13 -
llamafy/Qwen-Qwen2.5-1.5B-llamafied
Text Generation • 2B • Updated • 16 -
llamafy/Qwen-Qwen2.5-1.5B-Instruct-llamafied
Text Generation • 2B • Updated • 13
Spaces of the Week
My spaces or spaces I worked featured on Spaces of the Week! Ones at the top are the oldest, newest at the bottom 🤗
-
Running on L4Featured714
StyleTTS 2
🗣714Efficient, fast, and natural text to speech with StyleTTS 2!
-
Running on ZeroFeatured416
OpenDalle V1.1 GPU Demo
🖼416A demo of OpenDalle V1.1 on a ZERO GPU.
-
RunningFeatured74
RWKV Music
🎵74Generate MIDI music using RWKV v4!
-
Running on CPU UpgradeFeatured914
TTS Arena V2
🏆914Vote on the latest TTS models!
Failed Experiments
Experiments that didn't work out.
Voice Acting Models
With LAION