k2-fsa/OmniVoice

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/k2-fsa/OmniVoice)

k2-fsa / OmniVoice

High-Quality Voice Cloning TTS for 600+ Languages

☆8,611

Alternatives and similar repositories for OmniVoice

Users that are interested in OmniVoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenBMB / VoxCPM
View on GitHub
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
☆34,496Jul 8, 2026Updated 3 weeks ago
debpalash / OmniVoice-Studio
View on GitHub
Local voice clone, video dubbing, dictation and audiobook maker. The open-source ElevenLabs alternative.
☆9,144Updated this week
meituan-longcat / LongCat-AudioDiT
View on GitHub
☆555Apr 3, 2026Updated 3 months ago
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,023Dec 2, 2025Updated 7 months ago
QwenLM / Qwen3-TTS
View on GitHub
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streamin…
☆12,672Mar 17, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
microsoft / VibeVoice
View on GitHub
Open-Source Frontier Voice AI
☆51,268Updated this week
OpenMOSS / MOSS-TTS
View on GitHub
MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fi…
☆3,923Updated this week
Saganaki22 / ComfyUI-OmniVoice-TTS
View on GitHub
OmniVoice TTS nodes for ComfyUI - Zero-shot multilingual text-to-speech with voice cloning, voice design, and multi-speaker dialogue
☆521Jun 11, 2026Updated last month
supertone-inc / supertonic
View on GitHub
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
☆13,547Updated this week
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,559Updated this week
studio-dots-ai / dots.tts
View on GitHub
☆997Jul 10, 2026Updated 2 weeks ago
jamiepine / voicebox
View on GitHub
The open-source AI voice studio. Clone, dictate, create.
☆47,343Updated this week
OpenMOSS / MOSS-TTS-Nano
View on GitHub
MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, …
☆4,025Updated this week
resemble-ai / chatterbox
View on GitHub
SoTA open-source TTS
☆25,753Jul 21, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pnnbao97 / VieNeu-TTS
View on GitHub
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio quality • Chuyển văn bản thành giọng nói ti…
☆2,258Jul 17, 2026Updated last week
inclusionAI / Ming-omni-tts
View on GitHub
Ming-omni-tts: Simple and Efficient Unified Generation of Speech, Music, and Sound with Precise Control
☆264Feb 26, 2026Updated 5 months ago
HumeAI / tada
View on GitHub
Open Source Speech Language Model
☆1,006May 11, 2026Updated 2 months ago
kyutai-labs / pocket-tts
View on GitHub
A TTS that fits in your CPU (and pocket)
☆7,929Jul 16, 2026Updated last week
neuphonic / neutts
View on GitHub
On-device TTS model by Neuphonic
☆6,207Updated this week
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,464May 25, 2026Updated 2 months ago
index-tts / index-tts
View on GitHub
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
☆22,251Jul 14, 2026Updated 2 weeks ago
abus-aikorea / voice-pro
View on GitHub
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with…
☆11,250Jul 13, 2026Updated 2 weeks ago
ace-step / ACE-Step-1.5
View on GitHub
The most powerful local music generation model that outperforms almost all commercial alternatives, supporting Mac, AMD, Intel, and CUDA …
☆11,890Updated this week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ysharma3501 / LuxTTS
View on GitHub
A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.
☆4,869Jun 5, 2026Updated last month
QwenLM / Qwen3-ASR
View on GitHub
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music…
☆3,246Jun 26, 2026Updated last month
resemble-ai / DramaBox
View on GitHub
super expressive prompting model based on ltx2.3
☆470May 23, 2026Updated 2 months ago
zai-org / GLM-TTS
View on GitHub
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning
☆1,046Apr 10, 2026Updated 3 months ago
andimarafioti / faster-qwen3-tts
View on GitHub
Real-time text-to-speech with Qwen3-TTS
☆1,261Jul 17, 2026Updated last week
Zyphra / ZONOS2
View on GitHub
Zonos2 is a leading open-weight text-to-speech MoE.
☆290Jul 6, 2026Updated 3 weeks ago
heygen-com / hyperframes
View on GitHub
Write HTML. Render video. Built for agents.
☆38,571Updated this week
SWivid / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆15,042Updated this week
k2-fsa / Flow2GAN
View on GitHub
Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation
☆146Mar 8, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
KittenML / KittenTTS
View on GitHub
State-of-the-art TTS model under 25MB 😻
☆15,248Jun 11, 2026Updated last month
XiaomiMiMo / MiMo-Audio
View on GitHub
MiMo-Audio: Audio Language Models are Few-Shot Learners
☆1,070Jun 17, 2026Updated last month
hexgrad / kokoro
View on GitHub
https://hf.co/hexgrad/Kokoro-82M
☆8,168Aug 6, 2025Updated 11 months ago
k2-fsa / sherpa-onnx
View on GitHub
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…
☆13,851Updated this week
maemreyo / omnivoice-server
View on GitHub
OpenAI-compatible HTTP server for OmniVoice text-to-speech
☆76Jun 26, 2026Updated last month
NVIDIA / personaplex
View on GitHub
PersonaPlex code.
☆10,273Mar 2, 2026Updated 4 months ago
boson-ai / higgs-audio
View on GitHub
Text-audio foundation model from Boson AI
☆8,304Jun 5, 2026Updated last month