nazdridoy/kokoro-tts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nazdridoy/kokoro-tts)

nazdridoy / kokoro-tts

A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.

☆1,723

Alternatives and similar repositories for kokoro-tts

Users that are interested in kokoro-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hexgrad / kokoro
View on GitHub
https://hf.co/hexgrad/Kokoro-82M
☆8,122Aug 6, 2025Updated 11 months ago
remsky / Kokoro-FastAPI
View on GitHub
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/multiplatform CPU, AMD, NVIDIA GPU PyTorch support, handling, and auto-s…
☆5,255Updated this week
thewh1teagle / kokoro-onnx
View on GitHub
TTS with kokoro and onnx runtime
☆2,642Jul 5, 2026Updated 3 weeks ago
PierrunoYT / Kokoro-TTS-Local
View on GitHub
A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…
☆320Jun 14, 2026Updated last month
resemble-ai / chatterbox
View on GitHub
SoTA open-source TTS
☆25,697Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lucasjinreal / Kokoros
View on GitHub
🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.
☆801Jun 19, 2026Updated last month
OHF-Voice / piper1-gpl
View on GitHub
Fast and local neural text-to-speech engine
☆4,897Updated this week
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,260Dec 5, 2025Updated 7 months ago
KittenML / KittenTTS
View on GitHub
State-of-the-art TTS model under 25MB 😻
☆15,226Jun 11, 2026Updated last month
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,373Updated this week
next-1688 / 1688-source-suppliers
View on GitHub
1688找供应商 —— 结合用户需求与关键字查询对应的供应商及工厂信息核心工具能力：1688供应商查询能力。用于查询1688平台上的供应商及工厂信息。触发词：找供应商、查供应商、1688供应商、供应商信息、工厂信息、产业带查询。不触发场景：找商品/选品 → 1688-…
☆551May 7, 2026Updated 2 months ago
hanlin-afk / rt-cinfer-web
View on GitHub
Real-time causal inference framework for Web Vitals optimization using streaming SCMs, public CrUX/PageSpeed field data, and auditable in…
☆400Jul 2, 2026Updated 3 weeks ago
next-1688 / 1688-88syt
View on GitHub
88生意通是1688线下B2B交易的得力帮手，一句话搞定全流程操作！无论您是卖家还是买家，只需一句指令，即可轻松完成交易单创建、签署、确认收货、退款等核心操作，全面支持账号状态查询、实名认证、绑卡及交易，让每一步交易流程更清晰、更可控。通过智能化交互，实现交易流程数字化，提…
☆575Mar 27, 2026Updated 3 months ago
Zyphra / Zonos
View on GitHub
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…
☆7,235Mar 5, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
hexgrad / misaki
View on GitHub
G2P
☆482Aug 11, 2025Updated 11 months ago
rhasspy / piper
View on GitHub
A fast, local neural text to speech system
☆11,260Aug 26, 2025Updated 11 months ago
kyutai-labs / pocket-tts
View on GitHub
A TTS that fits in your CPU (and pocket)
☆7,882Jul 16, 2026Updated last week
neuphonic / neutts
View on GitHub
On-device TTS model by Neuphonic
☆6,199Updated this week
next-1688 / 1688-item-select
View on GitHub
1688 商家重点品圈选 —— 基于多维度商品评分智能识别值得重点运营的商品或搜索商品。工具能力：五维度评分（销售贡献、流量效率、成长潜力、营销ROI、商品健康度），商品分层（S/A/B/C级），搜索商品。触发词：重点品查看、圈选重点品、圈选运营商品、今日运营重点、选品…
☆623May 11, 2026Updated 2 months ago
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,518Nov 19, 2025Updated 8 months ago
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,019Dec 2, 2025Updated 7 months ago
QwenLM / Qwen3-TTS
View on GitHub
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streamin…
☆12,595Mar 17, 2026Updated 4 months ago
eduardolat / kokoro-web
View on GitHub
🔊 Kokoro Web: Free AI text-to-speech, online or self-hosted, OpenAI compatible!
☆672Mar 16, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mateogon / pdf-narrator
View on GitHub
Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…
☆198Feb 26, 2026Updated 5 months ago
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,318Aug 10, 2024Updated last year
k2-fsa / sherpa-onnx
View on GitHub
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…
☆13,783Updated this week
jieyefriic / nbcraft
View on GitHub
Multi-backend image and video generation CLI — Gemini, DashScope (wan), Volcengine Ark (Seedream/Seedance), OpenAI gpt-image-2.
☆155Apr 30, 2026Updated 2 months ago
SWivid / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆15,004Updated this week
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,809Aug 16, 2024Updated last year
supertone-inc / supertonic
View on GitHub
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
☆13,512Updated this week
k2-fsa / OmniVoice
View on GitHub
High-Quality Voice Cloning TTS for 600+ Languages
☆8,466Updated this week
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,252Jul 13, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,400May 25, 2026Updated 2 months ago
huggingface / parler-tts
View on GitHub
Inference and training library for high-quality TTS models.
☆5,581Dec 10, 2024Updated last year
kyutai-labs / delayed-streams-modeling
View on GitHub
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
☆2,984Jan 26, 2026Updated 6 months ago
nari-labs / dia
View on GitHub
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆19,358Nov 19, 2025Updated 8 months ago
microsoft / VibeVoice
View on GitHub
Open-Source Frontier Voice AI
☆50,493Updated this week
vibevoice-community / VibeVoice
View on GitHub
VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)
☆1,146Jun 12, 2026Updated last month
Blaizzy / mlx-audio
View on GitHub
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…
☆7,618Updated this week