kyutai-labs / pocket-ttsView external linksLinks
A TTS that fits in your CPU (and pocket)
☆3,134Feb 10, 2026Updated last week
Alternatives and similar repositories for pocket-tts
Users that are interested in pocket-tts are comparing it to the libraries listed below
Sorting:
- A lightning fast audio upsampler.☆710Feb 2, 2026Updated 2 weeks ago
- A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.☆766Updated this week
- Towards Human-Sounding Speech☆5,944Dec 5, 2025Updated 2 months ago
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,177Jan 15, 2026Updated last month
- Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.☆2,617Jan 22, 2026Updated 3 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,109Nov 19, 2025Updated 2 months ago
- SoTA open-source TTS☆22,571Feb 3, 2026Updated 2 weeks ago
- Human-taught Computer-use Agent Designed for Real Windows and MacOS Desktops.☆179Jan 20, 2026Updated 3 weeks ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 9 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,164Aug 10, 2024Updated last year
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,842Jan 26, 2026Updated 3 weeks ago
- On-device TTS model by Neuphonic☆4,794Updated this week
- Open-Source Frontier Voice AI☆23,186Feb 7, 2026Updated last week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14,079Updated this week
- A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automat…☆302Updated this week
- https://hf.co/hexgrad/Kokoro-82M☆5,625Aug 6, 2025Updated 6 months ago
- Interface for OuteTTS models.☆1,424Jun 21, 2025Updated 7 months ago
- A lightweight text-to-speech model with zero-shot voice cloning☆788Feb 6, 2026Updated last week
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆4,403Jan 4, 2026Updated last month
- A Conversational Speech Generation Model☆14,491May 27, 2025Updated 8 months ago
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆38Feb 10, 2026Updated last week
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 8 months ago
- State-of-the-art TTS model under 25MB 😻☆9,610Feb 2, 2026Updated 2 weeks ago
- MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fi…☆172Updated this week
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Feb 7, 2026Updated last week