HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTS Paper • 2309.13907 • Published Sep 25, 2023
Vec-Tok-VC+: Residual-enhanced Robust Zero-shot Voice Conversion with Progressive Constraints in a Dual-mode Training Strategy Paper • 2406.09844 • Published Jun 14, 2024
Single-Codec: Single-Codebook Speech Codec towards High-Performance Speech Generation Paper • 2406.07422 • Published Jun 11, 2024 • 1
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension Paper • 2402.07729 • Published Feb 12, 2024
Vec-Tok Speech: speech vectorization and tokenization for neural speech generation Paper • 2310.07246 • Published Oct 11, 2023 • 1
FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter Paper • 2406.08196 • Published Jun 12, 2024
SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation Paper • 2310.05051 • Published Oct 8, 2023