Configuration Parsing Warning: Invalid JSON for config file config.json

Garo TTS โ€“ Bootstrap VITS

This is a bootstrap multi-speaker Garo TTS model trained on noisy public speech data. It is intended as an acoustic prior, not a production-quality TTS.

Notes

  • Background noise may be present
  • Pronunciation may be inaccurate
  • Suitable for fine-tuning with clean, single-speaker studio data

Model

  • Architecture: VITS
  • Sample rate: 16 kHz
  • Speakers: 6
  • Phonemizer: eSpeak (English proxy)

Files

  • best_model.pth
  • config.json
  • speakers.json

Inference (Python)

from TTS.utils.synthesizer import Synthesizer

synth = Synthesizer(
    tts_checkpoint="best_model.pth",
    tts_config_path="config.json",
    voice_dir=".",
    use_cuda=False,
)

wav = synth.tts(
    text="ia nokni rongde rimit rong onga",
    speaker_name="southgarohills_male"
)

synth.save_wav(wav, "output.wav")
Downloads last month
14
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support