Cseti commited on
Commit
2dc30bc
·
verified ·
1 Parent(s): 8d10177

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -1
README.md CHANGED
@@ -1 +1,45 @@
1
- # Vibevoice
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - aoi-ot/VibeVoice-Large
4
+ tags:
5
+ - text-to-speech
6
+ - tts
7
+ - lora
8
+ - sft
9
+ - full-finetune
10
+ - vibevice
11
+ language:
12
+ - hu
13
+ ---
14
+ # VibeVoice_7B_Hun_v2
15
+ This is my newest finetuned VibeVoice 7B (Large) model tailored to Hungarian language.
16
+ I made this by training LoRA for the LLM module, did a full-finetune on the Diffusion head modules, then merged each of them to the base model.
17
+
18
+ To finetune the model I used the [following code](https://github.com/voicepowered-ai/VibeVoice-finetuning).
19
+
20
+ Thank you for [JPGallegoar](https://github.com/jpgallegoar-vpai) for that amazing VibeVoice trainer!
21
+
22
+ ## Inference
23
+ To use the LoRA model you can use [my modified fork](https://github.com/cseti007/VibeVoice)
24
+ until the [following PR](https://github.com/vibevoice-community/VibeVoice/pull/6)
25
+ will be merged into the main branch of [VibeVoice Community's repository](https://github.com/vibevoice-community/VibeVoice).
26
+
27
+ ## Examples
28
+
29
+ **Voice without LoRA**
30
+ <div style="display: flex; gap: 20px;">
31
+ <audio controls src="https://huggingface.co/Cseti/VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17/resolve/main/assets/synth_s42_nolora-1.wav"></audio>
32
+ <audio controls src="https://huggingface.co/Cseti/VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17/resolve/main/assets/synth_s98765_nolora-1.wav"></audio>
33
+ </div>
34
+
35
+
36
+ **Voice WITH LoRA**
37
+ <div style="display: flex; gap: 20px;">
38
+ <audio controls src="https://huggingface.co/Cseti/VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17/resolve/main/assets/synth_hu-lora_srand3.wav"></audio>
39
+ <audio controls src="https://huggingface.co/Cseti/VibeVoice_7B_Diffusion-head-LoRA_Hungarian-CV17/resolve/main/assets/synth_s42_hu-lora-1.wav"></audio>
40
+ </div>
41
+
42
+ Important Notes: This model is created as part of a fan project for research purposes only and is not intended for commercial use.
43
+ The dataset I used might contain material, which are protected by copyright. Users utilize the model at their own risk.
44
+ Users are obligated to comply with copyright laws and applicable regulations.
45
+ The model has been developed for research purposes, and it is not my intention to infringe on any copyright.