OpenMOSS/MOSS-TTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenMOSS/MOSS-TTS)

OpenMOSS / MOSS-TTS

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.

☆3,923

Alternatives and similar repositories for MOSS-TTS

Users that are interested in MOSS-TTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenMOSS / MOSS-Audio-Tokenizer
View on GitHub
MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…
☆248Jun 16, 2026Updated last month
OpenMOSS / MOSS-TTS-Nano
View on GitHub
MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, …
☆4,025Updated this week
OpenBMB / VoxCPM
View on GitHub
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
☆34,496Jul 8, 2026Updated 3 weeks ago
OpenMOSS / MOSS-TTSD
View on GitHub
MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flex…
☆1,370Updated this week
k2-fsa / OmniVoice
View on GitHub
High-Quality Voice Cloning TTS for 600+ Languages
☆8,611Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
studio-dots-ai / dots.tts
View on GitHub
☆997Jul 10, 2026Updated 2 weeks ago
OpenMOSS / MOSS-Audio
View on GitHub
MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoni…
☆620Jun 2, 2026Updated last month
OpenMOSS / MOSS-VL
View on GitHub
MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.
☆404Jul 23, 2026Updated last week
QwenLM / Qwen3-TTS
View on GitHub
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streamin…
☆12,672Mar 17, 2026Updated 4 months ago
supertone-inc / supertonic
View on GitHub
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
☆13,547Updated this week
OpenMOSS / MOVA
View on GitHub
MOVA: Towards Scalable and Synchronized Video–Audio Generation
☆1,087Jun 18, 2026Updated last month
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,559Updated this week
inclusionAI / Ming-omni-tts
View on GitHub
Ming-omni-tts: Simple and Efficient Unified Generation of Speech, Music, and Sound with Precise Control
☆264Feb 26, 2026Updated 5 months ago
microsoft / VibeVoice
View on GitHub
Open-Source Frontier Voice AI
☆51,268Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Soul-AILab / SoulX-Singer
View on GitHub
Official inference code for SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis
☆854May 29, 2026Updated 2 months ago
meituan-longcat / LongCat-AudioDiT
View on GitHub
☆555Apr 3, 2026Updated 3 months ago
zai-org / GLM-TTS
View on GitHub
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning
☆1,046Apr 10, 2026Updated 3 months ago
kyutai-labs / pocket-tts
View on GitHub
A TTS that fits in your CPU (and pocket)
☆7,929Jul 16, 2026Updated 2 weeks ago
jamiepine / voicebox
View on GitHub
The open-source AI voice studio. Clone, dictate, create.
☆47,343Updated this week
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,464May 25, 2026Updated 2 months ago
AMAPVOICE / PilotTTS
View on GitHub
☆212Jun 2, 2026Updated last month
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,023Dec 2, 2025Updated 7 months ago
HKUDS / ViMax
View on GitHub
"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"
☆11,385Jul 20, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cwx-worst-one / WavTTS
View on GitHub
WavTTS: Towards High-Quality Zero-Shot TTS via Direct Raw Waveform Modeling
☆210Jun 6, 2026Updated last month
OpenMOSS / MOSS-Speech
View on GitHub
MOSS-Speech is a true speech-to-speech large language model without text guidance.
☆139Feb 13, 2026Updated 5 months ago
resemble-ai / chatterbox
View on GitHub
SoTA open-source TTS
☆25,753Jul 21, 2026Updated last week
stepfun-ai / Step-Audio-EditX
View on GitHub
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…
☆956Apr 9, 2026Updated 3 months ago
index-tts / index-tts
View on GitHub
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
☆22,251Jul 14, 2026Updated 2 weeks ago
OpenMOSS / MOSS-Transcribe-Diarize
View on GitHub
MOSS-Transcribe-Diarize 0.9B is an open-source SOTA end-to-end audio understanding model for long-form multi-speaker transcription, diari…
☆1,329Updated this week
Soul-AILab / SAC
View on GitHub
[ACL 2026 Main] Training, inference, and testing of the SAC speech codec model.
☆108Nov 1, 2025Updated 8 months ago
resemble-ai / DramaBox
View on GitHub
super expressive prompting model based on ltx2.3
☆470May 23, 2026Updated 2 months ago
neuphonic / neutts
View on GitHub
On-device TTS model by Neuphonic
☆6,207Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
inclusionAI / Ming-UniAudio
View on GitHub
Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation
☆451Nov 27, 2025Updated 8 months ago
Zyphra / ZONOS2
View on GitHub
Zonos2 is a leading open-weight text-to-speech MoE.
☆290Jul 6, 2026Updated 3 weeks ago
dograh-hq / dograh
View on GitHub
Open source voice AI platform. Self-hosted alternative to Vapi and Retell. On Prem, BYOK across Speech to Speech or LLM/STT/TTS, with a …
☆5,071Updated this week
XiaomiMiMo / MiMo-Audio
View on GitHub
MiMo-Audio: Audio Language Models are Few-Shot Learners
☆1,070Jun 17, 2026Updated last month
Lightricks / LTX-2
View on GitHub
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
☆8,450Jul 8, 2026Updated 3 weeks ago
ysharma3501 / LavaSR
View on GitHub
🌋LavaSR: Fast Speech restoration and enhancement
☆566Jun 19, 2026Updated last month
justdubit / just-dub-it
View on GitHub
Code for 'JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion'
☆268May 11, 2026Updated 2 months ago