a710128/nanovllm-voxcpm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/a710128/nanovllm-voxcpm)

a710128 / nanovllm-voxcpm

☆278

Alternatives and similar repositories for nanovllm-voxcpm

Users that are interested in nanovllm-voxcpm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Ksuriuri / index-tts-vllm
View on GitHub
Added vLLM support to IndexTTS for faster inference.
☆1,214Apr 13, 2026Updated 3 months ago
xingchensong / FlashCosyVoice
View on GitHub
FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.
☆250Feb 25, 2026Updated 5 months ago
newgrit1004 / omnivoice-triton
View on GitHub
Triton kernel fusion & CUDA Graph optimization for OmniVoice inference — RMSNorm, SwiGLU, Norm+Residual, SageAttention
☆58Jul 20, 2026Updated last week
bluryar / VoxCPM-ONNX
View on GitHub
☆49Mar 18, 2026Updated 4 months ago
qi-hua / async_cosyvoice
View on GitHub
使用vllm加速cosyvoice2的推理
☆498Apr 26, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AMAPVOICE / PilotTTS
View on GitHub
☆212Jun 2, 2026Updated last month
k2-fsa / Flow2GAN
View on GitHub
Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation
☆146Mar 8, 2026Updated 4 months ago
yrom / finetune-index-tts
View on GitHub
IndexTTS Fine-tuning notebooks
☆139Jun 17, 2025Updated last year
tsdocode / nano-qwen3tts-vllm
View on GitHub
Qwen3-TTS with nano vLLM-style optimizations for fast text-to-speech generation. Achieved 3x faster
☆135Mar 3, 2026Updated 4 months ago
OpenBMB / VoxCPM
View on GitHub
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
☆34,391Jul 8, 2026Updated 3 weeks ago
zai-org / GLM-TTS
View on GitHub
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning
☆1,046Apr 10, 2026Updated 3 months ago
inclusionAI / MingTok-Audio
View on GitHub
☆88Feb 24, 2026Updated 5 months ago
ASLP-lab / FlashTTS
View on GitHub
Fast Streaming TTS with MTP Acceleration and X-pred Mean Flow Distillation
☆67Jun 16, 2026Updated last month
pengzhendong / wetext
View on GitHub
Python runtime for WeTextProcessing (does not depend on Pynini)
☆53Jun 11, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ASLP-lab / VoiceSculptor
View on GitHub
An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.
☆250Feb 26, 2026Updated 5 months ago
GiantAILab / DiaMoE-TTS
View on GitHub
Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptat…
☆246Nov 28, 2025Updated 8 months ago
OpenMOSS / MOSS-Speech
View on GitHub
MOSS-Speech is a true speech-to-speech large language model without text guidance.
☆139Feb 13, 2026Updated 5 months ago
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,021Dec 2, 2025Updated 7 months ago
bluryar / VoxCPM.cpp
View on GitHub
Standalone C++ inference project for VoxCPM models built on top of ggml.
☆88Jul 14, 2026Updated 2 weeks ago
krafton-ai / Raon-OpenTTS
View on GitHub
Open-source text-to-speech model from KRAFTON trained exclusively on public speech data, with curated datasets and reproducible training …
☆75May 21, 2026Updated 2 months ago
xingchensong / S3Tokenizer
View on GitHub
Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice
☆521Dec 22, 2025Updated 7 months ago
QwenAudio / CV3-Eval
View on GitHub
☆188Aug 25, 2025Updated 11 months ago
ASLP-lab / MeanVC
View on GitHub
A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows
☆298Jan 8, 2026Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hhguo / SoCodec
View on GitHub
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆92Dec 20, 2024Updated last year
QwenLM / Qwen3-ASR
View on GitHub
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music…
☆3,239Jun 26, 2026Updated last month
disco-speech / DisCo-Speech
View on GitHub
☆90Dec 31, 2025Updated 6 months ago
AFun9 / Omnivoice-onnx
View on GitHub
☆18May 13, 2026Updated 2 months ago
OpenMOSS / MOSS-TTSD
View on GitHub
MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flex…
☆1,363Updated this week
the-bird-F / Expressive-Vectors
View on GitHub
[ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis
☆40Dec 24, 2025Updated 7 months ago
boson-ai / EmergentTTS-Eval-public
View on GitHub
[NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.
☆226Dec 9, 2025Updated 7 months ago
JarodMica / index-tts
View on GitHub
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
☆146Nov 15, 2025Updated 8 months ago
FireRedTeam / FireRedASR2S
View on GitHub
A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/…
☆619Jun 2, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sunnyxrxrx / X-Voice
View on GitHub
X-Voice
☆177Jun 5, 2026Updated last month
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
Ereboas / TacoLM
View on GitHub
☆19May 2, 2024Updated 2 years ago
ZMXJJ / Voca
View on GitHub
Voca - Your local voice cloning assistant. Powered by VoxCPM
☆44Jul 2, 2026Updated 3 weeks ago
liutaocode / TTS-arxiv-daily
View on GitHub
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
☆663Updated this week
xinshengwang / robpitch
View on GitHub
A pitch detection model trained to be robust against noise and reverberation environments.
☆27Jan 21, 2025Updated last year
QwenAudio / Fun-ASR
View on GitHub
Open-source LLM-based ASR model family for Chinese, dialect, accent, and multilingual speech, with FunASR, vLLM, streaming, and llama.cpp…
☆1,438Updated this week