Added vLLM support to IndexTTS for faster inference.
☆1,075Mar 5, 2026Updated this week
Alternatives and similar repositories for index-tts-vllm
Users that are interested in index-tts-vllm are comparing it to the libraries listed below
Sorting:
- 使用vllm加速cosyvoice2的推理☆486Apr 26, 2025Updated 10 months ago
- An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System☆19,177Dec 2, 2025Updated 3 months ago
- 基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。☆591May 18, 2025Updated 9 months ago
- An Open-Sourced LLM-empowered Foundation TTS System☆905Sep 28, 2025Updated 5 months ago
- ☆476May 19, 2025Updated 9 months ago
- IndexTTS Fine-tuning notebooks☆136Jun 17, 2025Updated 8 months ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆872Dec 2, 2025Updated 3 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆95Oct 8, 2025Updated 5 months ago
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆242Feb 25, 2026Updated last week
- ☆485May 6, 2025Updated 10 months ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆63Dec 26, 2025Updated 2 months ago
- A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.☆105May 5, 2025Updated 10 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 7 months ago
- ✨✨[NeurIPS 2025] VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model☆675May 24, 2025Updated 9 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆19,913Feb 11, 2026Updated 3 weeks ago
- MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flex…☆1,191Mar 2, 2026Updated last week
- ☆36Sep 6, 2025Updated 6 months ago
- FastAPI Server Implementation for Bilibili Index TTS☆25Apr 13, 2025Updated 10 months ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 3 months ago
- finetune llm part for spark-tts model☆120Mar 25, 2025Updated 11 months ago
- An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…☆3,951Aug 14, 2025Updated 6 months ago
- Multilingual Voice Understanding Model☆7,669Dec 30, 2025Updated 2 months ago
- Preprocess Audio for training☆378Mar 2, 2026Updated last week
- Spark-TTS Inference Code☆10,942Apr 9, 2025Updated 11 months ago
- [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching☆1,253Mar 2, 2026Updated last week
- 一个用于CosyVoice的api接口项目☆336Aug 31, 2025Updated 6 months ago
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,788Feb 25, 2026Updated last week
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Sep 2, 2025Updated 6 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 3 months ago
- F5-TTS 推理加速,速度提升约4倍!☆123Jan 6, 2025Updated last year
- GLM-4-Voice | 端到端中英语音对话模型☆3,144Dec 5, 2024Updated last year
- ☆6,070Aug 29, 2025Updated 6 months ago
- ☆4,619Feb 13, 2026Updated 3 weeks ago
- ☆100Jan 19, 2026Updated last month
- [NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆194Dec 9, 2025Updated 3 months ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110May 20, 2025Updated 9 months ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆416Nov 20, 2025Updated 3 months ago
- ☆82Jan 22, 2025Updated last year