zhao-kun / VibeVoiceFusionLinks

VibeVoiceFusion is a full-stack, multi-speaker voice generation web system featuring LoRA fine-tuning, batch generation, and VRAM optimization. Based on Microsoft's VibeVoice (AR + diffusion architecture)
402Updated this week

Alternatives and similar repositories for VibeVoiceFusion

Users that are interested in VibeVoiceFusion are comparing it to the libraries listed below

Sorting: