soldier444xd/KittenTTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/soldier444xd/KittenTTS)

soldier444xd / KittenTTS

KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fast start. 😺

☆46

Alternatives and similar repositories for KittenTTS

Users that are interested in KittenTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

igorshmukler / kokoro-ruslan
View on GitHub
Kokoro Language Model Training Script for Russian (Ruslan Corpus)
☆52May 10, 2026Updated last week
p0p4k / vits3_pytorch
View on GitHub
☆28Nov 15, 2023Updated 2 years ago
jdevera / pylabeador
View on GitHub
A Python library and CLI tool to do automatic syllabification of Spanish words
☆15Sep 12, 2025Updated 8 months ago
alphacep / openfst
View on GitHub
Openfst mirror with some fixes
☆15Aug 23, 2024Updated last year
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆16Dec 3, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mush42 / istft-onnx
View on GitHub
Export an ONNX graph that performs ISTFT. Designed for TTS models.
☆28Apr 23, 2024Updated 2 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
zhaohb / MeloTTS-OV
View on GitHub
Using OpenVINO to speed up MeloTTS inference
☆15Nov 1, 2024Updated last year
AkshathRaghav / tinyspeech
View on GitHub
Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"
☆23Jun 7, 2025Updated 11 months ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
5Hyeons / StyleTTS2-Vocos
View on GitHub
StyleTTS2 + Vocos as a Decoder
☆13Mar 24, 2025Updated last year
fladdict / llmermaid
View on GitHub
☆76Dec 12, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
DDATT / Vits2-onnx-cpp
View on GitHub
Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++
☆18Apr 17, 2024Updated 2 years ago
ZachB100 / Piper-Training-Guide-with-Screen-Reader
View on GitHub
A guide to help newcomers to the Piper TTS system create voices for NVDA and other screen readers down the line.
☆26Dec 5, 2023Updated 2 years ago
chomeyama / wavehax
View on GitHub
Official repository of Wavehax vocoder
☆72Dec 20, 2025Updated 5 months ago
bhatiaabhinav / RL-v2
View on GitHub
Version 3.0.0 Pytorch implementations of DQN, DDQN, DDPG, SAC, Discrete SAC. With more features :)
☆12Feb 16, 2023Updated 3 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
Chengyuann / AutoStyle-TTS
View on GitHub
Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…
☆26Feb 1, 2026Updated 3 months ago
ictnlp / DST
View on GitHub
DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently
☆11Jun 6, 2024Updated last year
jonnor / brewing-audio-event-detection
View on GitHub
Tracking beer/wine using Audio Event Detection with Machine Learning
☆15Jun 16, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
NTIA / alignnet
View on GitHub
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆18Aug 1, 2025Updated 9 months ago
yamato0811 / streamlit-langgraph-HITL-copy-generator
View on GitHub
StreamlitとLangGraphで実装したHuman-in-the-loop広告コピー文生成アプリケーション
☆11Feb 15, 2025Updated last year
liuhuang31 / Megatts2_HierSpeechpp
View on GitHub
Megatts2 use HierSpeechpp's vocoder
☆18Dec 2, 2024Updated last year
facebookresearch / llama-hd-dataset
View on GitHub
This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.
☆22Jan 22, 2024Updated 2 years ago
CyberAgentAILab / webcolor
View on GitHub
Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.
☆22Dec 7, 2023Updated 2 years ago
wangzhaode / mnn-tts
View on GitHub
mnn tts demo.
☆19May 7, 2025Updated last year
showlab / DemoVLP
View on GitHub
[Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training
☆22Mar 19, 2022Updated 4 years ago
Picovoice / text-to-speech-benchmark
View on GitHub
Text-to-Speech Benchmark
☆26Apr 2, 2026Updated last month
wangzhaode / mnn-asr
View on GitHub
mnn asr demo.
☆27Mar 24, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
fakerybakery / OpenF5-TTS
View on GitHub
(WIP) A retrain of F5-TTS on permissively-licensed data
☆14Apr 6, 2025Updated last year
JosefAlbers / e2tts-mlx
View on GitHub
Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX
☆29Oct 15, 2024Updated last year
pku-lemonade / TokenSim
View on GitHub
TokenSim is a tool for simulating the behavior of large language models (LLMs) in a distributed environment.
☆22Sep 20, 2025Updated 8 months ago
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
TatsuyaKatayama / OpenFOAMandBaysianOpt_Notebooks
View on GitHub
☆12Dec 12, 2019Updated 6 years ago
onolab-tmu / asp-tutorial-2022
View on GitHub
Ono laboratory audio signal processing exercise for beginners.
☆19May 10, 2023Updated 3 years ago
Mobile-Artificial-Intelligence / babylon
View on GitHub
Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…
☆35Apr 14, 2026Updated last month