Ksuriuri/index-tts-vllm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ksuriuri/index-tts-vllm)

Ksuriuri / index-tts-vllm

Added vLLM support to IndexTTS for faster inference.

☆1,214

Alternatives and similar repositories for index-tts-vllm

Users that are interested in index-tts-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

index-tts / index-tts
View on GitHub
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
☆22,226Jul 14, 2026Updated 2 weeks ago
qi-hua / async_cosyvoice
View on GitHub
使用vllm加速cosyvoice2的推理
☆498Apr 26, 2025Updated last year
yrom / finetune-index-tts
View on GitHub
IndexTTS Fine-tuning notebooks
☆139Jun 17, 2025Updated last year
wzw773828204 / index-tts-vllm-stream
View on GitHub
在index-tts-vllm的基础上，实现了并提供了模拟流式合成音频的接口服务及客户端测试脚本
☆25Sep 2, 2025Updated 10 months ago
a710128 / nanovllm-voxcpm
View on GitHub
☆278Jul 16, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,021Dec 2, 2025Updated 7 months ago
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,464May 25, 2026Updated 2 months ago
OpenMOSS / MOSS-TTSD
View on GitHub
MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flex…
☆1,363Updated this week
xingchensong / FlashCosyVoice
View on GitHub
FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.
☆250Feb 25, 2026Updated 5 months ago
FireRedTeam / FireRedTTS
View on GitHub
An Open-Sourced LLM-empowered Foundation TTS System
☆909Sep 28, 2025Updated 10 months ago
HuiResearch / FlashTTS
View on GitHub
基于SparkTTS、OrpheusTTS等模型，提供高质量中文语音合成与声音克隆服务。
☆612May 18, 2025Updated last year
zai-org / GLM-TTS
View on GitHub
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning
☆1,046Apr 10, 2026Updated 3 months ago
FireRedTeam / FireRedTTS2
View on GitHub
Long-form streaming TTS system for multi-speaker dialogue generation
☆1,416Oct 26, 2025Updated 9 months ago
Holasyb918 / HeyGem-Linux-Python-Hack
View on GitHub
A docker free offline version for HeyGem; Python and Linux is all you need!
☆484Jan 12, 2026Updated 6 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
GiantAILab / DiaMoE-TTS
View on GitHub
Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptat…
☆246Nov 28, 2025Updated 8 months ago
xcc-zach / xtalk
View on GitHub
X-Talk is an open-source full-duplex cascaded spoken dialogue system framework enabling low-latency, interruptible, and human-like speech…
☆233Updated this week
QwenAudio / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,949Updated this week
FireRedTeam / FireRedASR
View on GitHub
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…
☆1,944Feb 25, 2026Updated 5 months ago
tuanh123789 / Spark-TTS-finetune
View on GitHub
finetune llm part for spark-tts model
☆126Mar 25, 2025Updated last year
asr-pub / index-tts-lora
View on GitHub
High-quality speech synthesis with LoRA fine-tuning on index-tts, enhancing prosody and naturalness for single and multi-speaker voices.
☆312Mar 12, 2026Updated 4 months ago
csllpr / index-tts-fastapi
View on GitHub
FastAPI Server Implementation for Bilibili Index TTS
☆25Apr 13, 2025Updated last year
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,398Updated this week
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,514Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
modelscope / ClearerVoice-Studio
View on GitHub
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…
☆4,343Aug 14, 2025Updated 11 months ago
ASLP-lab / VoiceSculptor
View on GitHub
An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.
☆250Feb 26, 2026Updated 5 months ago
Soul-AILab / SoulX-Podcast
View on GitHub
SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.
☆3,505Dec 11, 2025Updated 7 months ago
AMAPVOICE / PilotTTS
View on GitHub
☆212Jun 2, 2026Updated last month
ScottishFold007 / Cosyvoice_DPO_NOTES
View on GitHub
CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO Fine-Tuning!
☆126Aug 8, 2025Updated 11 months ago
QwenAudio / CV3-Eval
View on GitHub
☆188Aug 25, 2025Updated 11 months ago
the-bird-F / Expressive-Vectors
View on GitHub
[ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis
☆40Dec 24, 2025Updated 7 months ago
XiaomiMiMo / MiMo-Audio
View on GitHub
MiMo-Audio: Audio Language Models are Few-Shot Learners
☆1,068Jun 17, 2026Updated last month
JarodMica / index-tts
View on GitHub
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
☆146Nov 15, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Jesse-x86 / indextts-api
View on GitHub
an API server for indextts, allowing simple access without need to integrate actual code into your project
☆18Oct 29, 2025Updated 9 months ago
stepfun-ai / Step-Audio2
View on GitHub
Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…
☆1,485Mar 16, 2026Updated 4 months ago
bytedance / LatentSync
View on GitHub
Taming Stable Diffusion for Lip Sync!
☆5,931Jun 20, 2025Updated last year
stepfun-ai / Step-Audio-EditX
View on GitHub
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…
☆955Apr 9, 2026Updated 3 months ago
Plachtaa / seed-vc
View on GitHub
zero-shot voice conversion & singing voice conversion, with real-time support
☆3,891Apr 20, 2025Updated last year
VITA-MLLM / VITA-Audio
View on GitHub
✨✨[NeurIPS 2025] VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model
☆682May 24, 2025Updated last year
zai-org / GLM-4-Voice
View on GitHub
GLM-4-Voice | 端到端中英语音对话模型
☆3,210Dec 5, 2024Updated last year