VibeVoice_7B_hun_v2 / README.md

Cseti

Update README.md

fe0eca7 verified about 2 months ago

preview code

raw

history blame

2.12 kB

metadata

base_model:
  - aoi-ot/VibeVoice-Large
tags:
  - text-to-speech
  - tts
  - lora
  - sft
  - full-finetune
  - vibevice
language:
  - hu

VibeVoice_7B_Hun_v2

This is my newest finetuned VibeVoice 7B (Large) model tailored to Hungarian language. I made this by training LoRA for the LLM module, did a full-finetune on the Diffusion head modules, then merged each of them to the base model.

To finetune the model I used the following code.

Thank you for JPGallegoar for that amazing VibeVoice trainer!

Inference

For inference, you can use

this Comfyui node
Demo codes on VibeVoice Community's repository

Examples

These examples were made with 4bit inference. One can get even better results without quantization.

Voice without LoRA

Voice WITH LoRA

Important Notes: This model is created as part of a fan project for research purposes only and is not intended for commercial use. The dataset I used might contain material, which are protected by copyright. Users utilize the model at their own risk. Users are obligated to comply with copyright laws and applicable regulations. The model has been developed for research purposes, and it is not my intention to infringe on any copyright.