vibevoice-community/VibeVoice

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vibevoice-community/VibeVoice)

vibevoice-community / VibeVoice

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

☆1,151

Alternatives and similar repositories for VibeVoice

Users that are interested in VibeVoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

voicepowered-ai / VibeVoice-finetuning
View on GitHub
Unofficial WIP LoRa Finetuning repository for VibeVoice
☆371Sep 24, 2025Updated 10 months ago
Enemyx-net / VibeVoice-ComfyUI
View on GitHub
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice …
☆1,521Feb 18, 2026Updated 5 months ago
vibevoice-community / VibeVoice-API
View on GitHub
API server for VibeVoice
☆29Sep 28, 2025Updated 10 months ago
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,023Dec 2, 2025Updated 7 months ago
microsoft / VibeVoice
View on GitHub
Open-Source Frontier Voice AI
☆51,268Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jordandare / echo-tts
View on GitHub
Echo-TTS inference codebase
☆212Dec 5, 2025Updated 7 months ago
stlohrey / chatterbox-finetuning
View on GitHub
SoTA open-source TTS
☆136Jun 7, 2025Updated last year
haoweilou / ParaStyleTTS
View on GitHub
This is the official code for ACM CIKM 2025 Paper: ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive …
☆59Dec 21, 2025Updated 7 months ago
Stylish-TTS / stylish-tts
View on GitHub
High quality text-to-speech based on StyleTTS 2.
☆78Apr 6, 2026Updated 3 months ago
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
HeCheng0625 / Diffusion-Speech-Tokenizer
View on GitHub
This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…
☆198Jan 25, 2026Updated 6 months ago
Soul-AILab / SAC
View on GitHub
[ACL 2026 Main] Training, inference, and testing of the SAC speech codec model.
☆108Nov 1, 2025Updated 8 months ago
fakerybakery / openvoicelab
View on GitHub
A beginner-friendly inference to finetune & run inference on open TTS models 🗣️
☆30Feb 4, 2026Updated 5 months ago
boson-ai / higgs-audio
View on GitHub
Text-audio foundation model from Boson AI
☆8,304Jun 5, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
OpenMOSS / MOSS-TTSD
View on GitHub
MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flex…
☆1,370Updated this week
xingchensong / S3Tokenizer
View on GitHub
Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice
☆521Dec 22, 2025Updated 7 months ago
kyutai-labs / delayed-streams-modeling
View on GitHub
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
☆2,993Jan 26, 2026Updated 6 months ago
flamed-tts / Flamed-TTS
View on GitHub
This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …
☆57Aug 9, 2025Updated 11 months ago
dontriskit / VibeVoice-FastAPI
View on GitHub
🎙️ VibeVoice FastAPI - Multi-Speaker TTS API
☆32Aug 27, 2025Updated 11 months ago
ysharma3501 / MiraTTS
View on GitHub
A high quality and fast TTS repository
☆517Dec 22, 2025Updated 7 months ago
resemble-ai / chatterbox
View on GitHub
SoTA open-source TTS
☆25,753Jul 21, 2026Updated last week
XiaomiMiMo / MiMo-Audio
View on GitHub
MiMo-Audio: Audio Language Models are Few-Shot Learners
☆1,070Jun 17, 2026Updated last month
k2-fsa / OmniVoice
View on GitHub
High-Quality Voice Cloning TTS for 600+ Languages
☆8,611Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wildminder / ComfyUI-VibeVoice
View on GitHub
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
☆588Sep 25, 2025Updated 10 months ago
ictnlp / SLED-TTS
View on GitHub
Streamable Text-to-Speech model using a language modeling approach, without vector quantization
☆108May 20, 2025Updated last year
Audio-Foundation-Models / ConversationTTS
View on GitHub
☆101Jan 19, 2026Updated 6 months ago
yl4579 / DMOSpeech2
View on GitHub
☆302Jul 22, 2025Updated last year
OpenMOSS / MOSS-TTS
View on GitHub
MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fi…
☆3,923Updated this week
IDEA-Emdoor-Lab / UniTTS
View on GitHub
A TTS Trained on Universal Audio.
☆41Jun 6, 2025Updated last year
EZ-VC / EZ-VC
View on GitHub
[EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion
☆43Sep 9, 2025Updated 10 months ago
stepfun-ai / Step-Audio-EditX
View on GitHub
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…
☆956Apr 9, 2026Updated 3 months ago
Vyvo-Labs / VyvoTTS
View on GitHub
VyvoTTS: LLM-Based Text-to-Speech Training Framework
☆257Apr 8, 2026Updated 3 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
smallbraineng / smalltts
View on GitHub
superfast text to speech in any voice
☆62Feb 16, 2026Updated 5 months ago
JarodMica / index-tts
View on GitHub
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
☆146Nov 15, 2025Updated 8 months ago
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,264Dec 5, 2025Updated 7 months ago
yrom / finetune-index-tts
View on GitHub
IndexTTS Fine-tuning notebooks
☆139Jun 17, 2025Updated last year
ysharma3501 / FlashSR
View on GitHub
Fast audio super resolution from 16khz to 48khz.
☆215Jan 3, 2026Updated 6 months ago
FireRedTeam / FireRedTTS2
View on GitHub
Long-form streaming TTS system for multi-speaker dialogue generation
☆1,417Oct 26, 2025Updated 9 months ago
ace-step / ACE-Step
View on GitHub
ACE-Step: A Step Towards Music Generation Foundation Model
☆4,701Feb 15, 2026Updated 5 months ago