KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fast start. πΊ
β45Apr 26, 2026Updated this week
Alternatives and similar repositories for KittenTTS
Users that are interested in KittenTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using aβ¦β12Mar 24, 2023Updated 3 years ago
- Kokoro Language Model Training Script for Russian (Ruslan Corpus)β49Apr 23, 2026Updated last week
- β28Nov 15, 2023Updated 2 years ago
- A Python library and CLI tool to do automatic syllabification of Spanish wordsβ15Sep 12, 2025Updated 7 months ago
- Bagel but with Gradio Interfaceβ20May 21, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Openfst mirror with some fixesβ15Aug 23, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ16Dec 3, 2024Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.β28Apr 23, 2024Updated 2 years ago
- β14Aug 19, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPAβ18Aug 16, 2024Updated last year
- Using OpenVINO to speed up MeloTTS inferenceβ15Nov 1, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β22Jun 7, 2025Updated 10 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightningβ18Oct 20, 2024Updated last year
- Forced alignment decoder for Whisper.β15Mar 13, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- StyleTTS2 + Vocos as a Decoderβ13Mar 24, 2025Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++β18Apr 17, 2024Updated 2 years ago
- A windows pinokio script for roop-unleashed Unsure if it works on other OSβ15Jan 16, 2026Updated 3 months ago
- β18Dec 2, 2025Updated 4 months ago
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentationβ11Mar 7, 2023Updated 3 years ago
- Official repository of Wavehax vocoderβ68Dec 20, 2025Updated 4 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- β19Mar 22, 2024Updated 2 years ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-β¦β25Feb 1, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrentlyβ11Jun 6, 2024Updated last year
- Tracking beer/wine using Audio Event Detection with Machine Learningβ15Jun 16, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.β18Aug 1, 2025Updated 9 months ago
- Megatts2 use HierSpeechpp's vocoderβ18Dec 2, 2024Updated last year
- β12Feb 16, 2026Updated 2 months ago
- Collection of experimental tools, utilities & prototypes from the DJZ/ORAGEN development labβ18Aug 25, 2024Updated last year
- NVDA advanced OCRβ19Dec 22, 2025Updated 4 months ago
- Demo of knowledge graph creation and Graph RAG with Dspy and Kuzuβ22Jun 30, 2025Updated 10 months ago
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.β22Jan 22, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- mnn tts demo.β19May 7, 2025Updated 11 months ago
- Text-to-Speech Benchmarkβ24Apr 2, 2026Updated 3 weeks ago
- CUDA-accelerated video tool β split, shuffle & rejoin video segments with precise length controlβ21Dec 30, 2024Updated last year
- mnn asr demo.β26Mar 24, 2025Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed dataβ14Apr 6, 2025Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLXβ29Oct 15, 2024Updated last year
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (ICβ¦β20May 12, 2023Updated 2 years ago