kyutai-labs/pocket-tts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyutai-labs/pocket-tts)

kyutai-labs / pocket-tts

A TTS that fits in your CPU (and pocket)

☆7,882

Alternatives and similar repositories for pocket-tts

Users that are interested in pocket-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

supertone-inc / supertonic
View on GitHub
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
☆13,512Updated this week
resemble-ai / chatterbox
View on GitHub
SoTA open-source TTS
☆25,697Updated this week
KittenML / KittenTTS
View on GitHub
State-of-the-art TTS model under 25MB 😻
☆15,226Jun 11, 2026Updated last month
neuphonic / neutts
View on GitHub
On-device TTS model by Neuphonic
☆6,199Updated this week
microsoft / VibeVoice
View on GitHub
Open-Source Frontier Voice AI
☆50,493Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jamiepine / voicebox
View on GitHub
The open-source AI voice studio. Clone, dictate, create.
☆46,736Updated this week
kyutai-labs / delayed-streams-modeling
View on GitHub
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
☆2,984Jan 26, 2026Updated 6 months ago
ysharma3501 / LuxTTS
View on GitHub
A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.
☆4,852Jun 5, 2026Updated last month
Zackriya-Solutions / meetily
View on GitHub
Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization bui…
☆26,641Jun 5, 2026Updated last month
moonshine-ai / moonshine
View on GitHub
Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces
☆10,435Updated this week
QwenLM / Qwen3-TTS
View on GitHub
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streamin…
☆12,595Mar 17, 2026Updated 4 months ago
kyutai-labs / unmute
View on GitHub
Make text LLMs listen and speak
☆1,374Jul 16, 2026Updated last week
NVIDIA / personaplex
View on GitHub
PersonaPlex code.
☆10,262Mar 2, 2026Updated 4 months ago
hexgrad / kokoro
View on GitHub
https://hf.co/hexgrad/Kokoro-82M
☆8,122Aug 6, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
k2-fsa / OmniVoice
View on GitHub
High-Quality Voice Cloning TTS for 600+ Languages
☆8,466Updated this week
RyanCodrai / turbovec
View on GitHub
A vector index built on TurboQuant, written in Rust with Python bindings
☆14,284Updated this week
alibaba / zvec
View on GitHub
A lightweight, lightning-fast, in-process vector database
☆15,266Updated this week
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,373Updated this week
OpenBMB / VoxCPM
View on GitHub
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
☆34,190Jul 8, 2026Updated 2 weeks ago
OpenMOSS / MOSS-TTS
View on GitHub
MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fi…
☆3,901Jun 22, 2026Updated last month
ysharma3501 / NovaSR
View on GitHub
A lightning fast audio upsampler.
☆775Feb 26, 2026Updated 4 months ago
ekwek1 / soprano
View on GitHub
Soprano: Instant, Ultra-Realistic Text-to-Speech
☆1,374Jan 15, 2026Updated 6 months ago
aaif-goose / goose
View on GitHub
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
☆51,671Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cjpais / Handy
View on GitHub
A free, open source, and extensible speech-to-text application that works completely offline.
☆27,466Updated this week
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,878Updated this week
TencentCloud / CubeSandbox
View on GitHub
Instant, Concurrent, Secure & Lightweight Sandbox for AI Agents.
☆10,683Updated this week
pipecat-ai / pipecat
View on GitHub
Open Source framework for voice and multimodal conversational AI
☆13,702Updated this week
StarTrail-org / LEANN
View on GitHub
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on …
☆12,728Updated this week
lfnovo / open-notebook
View on GitHub
An Open Source implementation of Notebook LM with more flexibility and features
☆35,993Updated this week
AlexsJones / llmfit
View on GitHub
Hundreds of models & providers. One command to find what runs on your hardware.
☆30,618Updated this week
kyutai-labs / hibiki-zero
View on GitHub
A real-time and multilingual speech translation model
☆264Feb 13, 2026Updated 5 months ago
alibaba / page-agent
View on GitHub
JavaScript in-page GUI agent. Control web interfaces with natural language.
☆27,822Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
run-llama / liteparse
View on GitHub
A fast, helpful, and open-source document parser
☆11,775Updated this week
lightpanda-io / browser
View on GitHub
Lightpanda: the headless browser designed for AI and automation
☆32,149Updated this week
samuel-vitorino / sopro
View on GitHub
A lightweight text-to-speech model with zero-shot voice cloning
☆877Feb 6, 2026Updated 5 months ago
ekwek1 / soprano-factory
View on GitHub
Soprano-Factory: Train your own 2000x realtime text-to-speech model
☆252Jan 13, 2026Updated 6 months ago
topoteretes / cognee
View on GitHub
Cognee is the open-source AI memory platform for agents. Give your AI agents persistent long-term memory across sessions with a self-host…
☆29,305Updated this week
D4Vinci / Scrapling
View on GitHub
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
☆71,215Updated this week
iOfficeAI / OfficeCLI
View on GitHub
OfficeCLI is the first and best Office suite purpose-built for AI agents to read, edit, and automate Word, Excel, and PowerPoint files. …
☆22,141Updated this week