KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fast start. πΊ
β24Mar 16, 2026Updated this week
Alternatives and similar repositories for KittenTTS
Users that are interested in KittenTTS are comparing it to the libraries listed below
Sorting:
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using aβ¦β12Mar 24, 2023Updated 2 years ago
- Kokoro Language Model Training Script for Russian (Ruslan Corpus)β39Updated this week
- β11Apr 1, 2025Updated 11 months ago
- GUI Library for AutoIt, based on windows API.β12Jun 16, 2023Updated 2 years ago
- β28Nov 15, 2023Updated 2 years ago
- Toy example on how to build a unit selection TTS in Spanishβ11May 10, 2019Updated 6 years ago
- Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]β41Mar 7, 2026Updated last week
- A Python library and CLI tool to do automatic syllabification of Spanish wordsβ15Sep 12, 2025Updated 6 months ago
- β15Sep 4, 2024Updated last year
- Bagel but with Gradio Interfaceβ20May 21, 2025Updated 9 months ago
- An MCP server providing intelligent transcript processing capabilities, featuring natural formatting, contextual repair, and smart summarβ¦β19Mar 14, 2025Updated last year
- All in one Gradio interface for chatterboxβ19May 31, 2025Updated 9 months ago
- β14Apr 8, 2025Updated 11 months ago
- β13Nov 24, 2025Updated 3 months ago
- Simple GUI for Amphion Vevoβ14May 4, 2025Updated 10 months ago
- Openfst mirror with some fixesβ14Aug 23, 2024Updated last year
- π UI/UX context detection engineβ12Jan 3, 2021Updated 5 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ16Dec 3, 2024Updated last year
- A sub project of AIStoryBuilders.comβ14Jan 28, 2024Updated 2 years ago
- Write your next novel faster and easierβ15Dec 7, 2025Updated 3 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.β28Apr 23, 2024Updated last year
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-β¦β16Feb 1, 2026Updated last month
- Code that accompanies the PyData New York (2022) talk: Addressing the sensitivity of Large language modelsβ13Nov 7, 2022Updated 3 years ago
- A windows pinokio script for roop-unleashed Unsure if it works on other OSβ10Jan 16, 2026Updated 2 months ago
- β14Aug 19, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPAβ18Aug 16, 2024Updated last year
- β12Apr 16, 2024Updated last year
- Using OpenVINO to speed up MeloTTS inferenceβ15Nov 1, 2024Updated last year
- Enable AI agents to create and execute their own tools in Rust autonomously. Available as web API and MCP servicesβ81Aug 28, 2025Updated 6 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β21Jun 7, 2025Updated 9 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightningβ18Oct 20, 2024Updated last year
- Forced alignment decoder for Whisper.β14Mar 13, 2024Updated 2 years ago
- StyleTTS2 + Vocos as a Decoderβ13Mar 24, 2025Updated 11 months ago
- β17Nov 25, 2025Updated 3 months ago
- β18Dec 4, 2025Updated 3 months ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++β18Apr 17, 2024Updated last year
- β21Apr 15, 2025Updated 11 months ago
- β11May 5, 2022Updated 3 years ago
- (NVIDIA) FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively.β21Dec 18, 2025Updated 3 months ago