A fast, local neural text to speech system
β11,140Aug 26, 2025Updated 10 months ago
Alternatives and similar repositories for piper
Users that are interested in piper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ45,642Aug 16, 2024Updated last year
- A multi-voice TTS system trained with an emphasis on qualityβ14,865Nov 19, 2024Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,295Aug 10, 2024Updated last year
- Port of OpenAI's Whisper model in C/C++β51,030Jun 23, 2026Updated last week
- Faster Whisper transcription with CTranslate2β23,840Nov 19, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fast and local neural text-to-speech engineβ4,586Jun 22, 2026Updated last week
- π Text-Prompted Generative Audio Modelβ39,172Aug 19, 2024Updated last year
- An Open Source text-to-speech system built by inverting Whisper.β4,620Dec 14, 2025Updated 6 months ago
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β6,605Jun 22, 2026Updated last week
- Inference and training library for high-quality TTS models.β5,583Dec 10, 2024Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.β36,789Apr 19, 2025Updated last year
- LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.β47,070Jun 23, 2026Updated last week
- C++ library for converting text to phonemes for Piperβ142Jul 10, 2025Updated 11 months ago
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntimeβ¦β13,210Updated this week
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Converts text to speech in realtimeβ3,971May 31, 2026Updated last month
- Robust Speech Recognition via Large-Scale Weak Supervisionβ103,646Apr 15, 2026Updated 2 months ago
- Foundational model for human-like, expressive TTSβ4,202Jul 30, 2024Updated last year
- LLM inference in C/C++β118,422Updated this week
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β7,518Dec 24, 2024Updated last year
- Zero-Shot Speech Editing and Text-to-Speech in the Wildβ8,501May 30, 2026Updated last month
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β22,716Updated this week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β14,817May 18, 2026Updated last month
- SOTA Open Source TTSβ30,996Jun 9, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- https://hf.co/hexgrad/Kokoro-82Mβ7,683Aug 6, 2025Updated 10 months ago
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advβ¦β2,394Jan 9, 2026Updated 5 months ago
- Distribute and run LLMs with a single file.β25,105Updated this week
- Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.β47,360Jun 2, 2026Updated 3 weeks ago
- Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β174,889Updated this week
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ9,425Jun 19, 2026Updated last week
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Nodeβ14,876Jun 4, 2026Updated 3 weeks ago
- Towards Human-Sounding Speechβ6,206Dec 5, 2025Updated 6 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generationβ865Nov 16, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A TTS model capable of generating ultra-realistic dialogue in one pass.β19,323Nov 19, 2025Updated 7 months ago
- A fast local neural text to speech engine for Mycroftβ1,264Mar 25, 2025Updated last year
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/multiplatform CPU, AMD, NVIDIA GPU PyTorch support, handling, and auto-sβ¦β5,068Jun 18, 2026Updated last week
- Local voice recording for creating Piper datasetsβ217Feb 20, 2026Updated 4 months ago
- An open source voice assistant toolkit for many human languagesβ382Dec 26, 2023Updated 2 years ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β143,369Updated this week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audiβ¦β10,475May 16, 2026Updated last month