voicepowered-ai/VibeVoice-finetuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/voicepowered-ai/VibeVoice-finetuning)

voicepowered-ai / VibeVoice-finetuning

Unofficial WIP LoRa Finetuning repository for VibeVoice

☆371

Alternatives and similar repositories for VibeVoice-finetuning

Users that are interested in VibeVoice-finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vibevoice-community / VibeVoice
View on GitHub
VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)
☆1,151Jun 12, 2026Updated last month
fakerybakery / openvoicelab
View on GitHub
A beginner-friendly inference to finetune & run inference on open TTS models 🗣️
☆30Feb 4, 2026Updated 5 months ago
stlohrey / chatterbox-finetuning
View on GitHub
SoTA open-source TTS
☆136Jun 7, 2025Updated last year
Enemyx-net / VibeVoice-ComfyUI
View on GitHub
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice …
☆1,521Feb 18, 2026Updated 5 months ago
jordandare / echo-tts
View on GitHub
Echo-TTS inference codebase
☆212Dec 5, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Vyvo-Labs / VyvoTTS
View on GitHub
VyvoTTS: LLM-Based Text-to-Speech Training Framework
☆257Apr 8, 2026Updated 3 months ago
primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 11 months ago
Marvis-Labs / marvis-tts
View on GitHub
☆365Aug 28, 2025Updated 11 months ago
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,023Dec 2, 2025Updated 7 months ago
ysharma3501 / MiraTTS
View on GitHub
A high quality and fast TTS repository
☆517Dec 22, 2025Updated 7 months ago
randombk / chatterbox-vllm
View on GitHub
VLLM Port of the Chatterbox TTS model
☆379Oct 18, 2025Updated 9 months ago
taresh18 / TTSizer
View on GitHub
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets
☆142Aug 10, 2025Updated 11 months ago
yl4579 / DMOSpeech2
View on GitHub
☆302Jul 22, 2025Updated last year
meaningTeam / tidy-tunes
View on GitHub
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆23May 19, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
wildminder / ComfyUI-VibeVoice
View on GitHub
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
☆588Sep 25, 2025Updated 10 months ago
JarodMica / chatterbox
View on GitHub
SoTA open-source TTS
☆26Jul 8, 2025Updated last year
diodiogod / TTS-Audio-Suite
View on GitHub
A ComfyUI custom node integration for local multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwe…
☆1,128Updated this week
smallbraineng / smalltts
View on GitHub
superfast text to speech in any voice
☆62Feb 16, 2026Updated 5 months ago
zeropointnine / tts-audiobook-tool
View on GitHub
Audiobook creation app supporting too many TTS models (Qwen3-TTS, OmniVoice, VibeVoice, etc), focused on high-quality output. Plus audio-…
☆176Updated this week
davidbrowne17 / chatterbox-streaming
View on GitHub
Streaming and Fine-tuning for Chatterbox TTS
☆292Jun 15, 2025Updated last year
LEMAS-Project / LEMAS-TTS
View on GitHub
LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10 languages: Chinese English Spanish Russian French German Ital…
☆101Mar 31, 2026Updated 3 months ago
Soul-AILab / SAC
View on GitHub
[ACL 2026 Main] Training, inference, and testing of the SAC speech codec model.
☆108Nov 1, 2025Updated 8 months ago
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nineninesix-ai / KaniTTS-Finetune-pipeline
View on GitHub
☆27Nov 3, 2025Updated 8 months ago
disco-speech / DisCo-Speech
View on GitHub
☆90Dec 31, 2025Updated 6 months ago
ysharma3501 / FastNeuTTS
View on GitHub
A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!
☆118Nov 24, 2025Updated 8 months ago
psdwizzard / chatterbox-Audiobook
View on GitHub
SoTA open-source TTS for Audiobook and Podcast Generation
☆205Jun 19, 2025Updated last year
fluxions-ai / stftvae
View on GitHub
Inference for the STFT-VAE continuous audio codec (24kHz, 3.125Hz latent)
☆43Jul 12, 2026Updated 2 weeks ago
tuanh123789 / Spark-TTS-finetune
View on GitHub
finetune llm part for spark-tts model
☆126Mar 25, 2025Updated last year
rsxdalv / chatterbox
View on GitHub
SoTA open-source TTS
☆165Dec 16, 2025Updated 7 months ago
stlohrey / dia-finetuning
View on GitHub
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆131Jul 25, 2025Updated last year
HeCheng0625 / Diffusion-Speech-Tokenizer
View on GitHub
This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…
☆198Jan 25, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
petermg / Chatterbox-TTS-Extended
View on GitHub
Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially fo…
☆573Aug 23, 2025Updated 11 months ago
knottwill / sesame-finetune
View on GitHub
Finetune Sesame AI's conversational speech model on new languages and voices. Blog post: https://blog.speechmatics.com/sesame-finetune
☆113Sep 27, 2025Updated 10 months ago
shootthesound / lora-the-explorer
View on GitHub
Advanced FLUX LoRA manipulation toolkit with GUI interface
☆59Nov 5, 2025Updated 8 months ago
SWivid / AUV
View on GitHub
An All-in-One Speech, Sound, Music Codec with Single Nested Codebook
☆28Oct 11, 2025Updated 9 months ago
thomasgauthier / csm-hf
View on GitHub
Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers
☆58May 17, 2025Updated last year
kyutai-labs / delayed-streams-modeling
View on GitHub
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
☆2,993Jan 26, 2026Updated 6 months ago
o-l-l-i / simple-captioner
View on GitHub
Simple image and video captioning app with a Gradio UI, powered by Qwen2.5/3 VL Instruct.
☆25Apr 1, 2026Updated 3 months ago