AI & ML interests

Unofficial MegaTTS 3 model mirrors. Unless otherwise specified, models should be under Apache license

mrfakenameย 
posted an update 28 days ago
view post
Post
4619
Excited to share that I've joined the Hugging Face Fellows program! ๐Ÿค—

Looking forward to contributing to & working more closely with the open-source ecosystem - huge thanks to everyone who's supported me on this journey! ๐Ÿš€
mrfakenameย 
posted an update 2 months ago
view post
Post
6093
Trained a model for emotion-controllable TTS based on MiMo audio on LAION's dataset.

Still very early and does have an issue with hallucinating but results seem pretty good so far, given that it is very early into the training run.

Will probably kick off a new run later with some settings tweaked.

Put up a demo here: https://huggingface.co/spaces/mrfakename/EmoAct-MiMo

(Turn ๐Ÿ”Š on to hear audio samples)
ยท
mrfakenameย 
in MegaTTS3/g2p 9 months ago

ๅคš้Ÿณๅญ—

1
#1 opened 9 months ago by
wangweifei
mrfakenameย 
posted an update 9 months ago
view post
Post
3672
Papla P1 from Papla Media is now available on the TTS Arena!

Try out Papla's new ultra-realistic TTS model + compare it with other leading models on the TTS Arena: TTS-AGI/TTS-Arena
mrfakenameย 
posted an update 10 months ago
mrfakenameย 
posted an update 10 months ago
mrfakenameย 
posted an update 11 months ago
view post
Post
2741
Iโ€™m excited to introduce a new leaderboard UI + keyboard shortcuts on the TTS Arena!

The refreshed UI for the leaderboard is smoother and (hopefully) more intuitive. You can now view models based on a simpler win-rate percentage and exclude closed models.

In addition, the TTS Arena now supports keyboard shortcuts. This should make voting much more efficient as you can now vote without clicking anything!

In both the normal Arena and Battle Mode, press "r" to select a random text, Cmd/Ctrl + Enter to synthesize, and "a"/"b" to vote! View more details about keyboard shortcuts by pressing "?" (Shift + /) on the Arena.

Check out all the new updates on the TTS Arena:

TTS-AGI/TTS-Arena
  • 1 reply
ยท
mrfakenameย 
posted an update about 1 year ago
view post
Post
7917
I just released an unofficial demo for Moonshine ASR!

Moonshine is a fast, efficient, & accurate ASR model released by Useful Sensors. It's designed for on-device inference and licensed under the MIT license!

HF Space (unofficial demo): mrfakename/Moonshine
GitHub repo for Moonshine: https://github.com/usefulsensors/moonshine