rmcpantoja / piperLinks
A fast, local neural text to speech system
☆16Updated 8 months ago
Alternatives and similar repositories for piper
Users that are interested in piper are comparing it to the libraries listed below
Sorting:
- Public voice datasets used for our Text-to-Speech voices.☆45Updated 5 months ago
- Minimalist stable-diffusion desktop application with only one executable file writen with golang ( No python ).☆18Updated 7 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆46Updated 6 months ago
- GradioUI for TortoiseTTS voice generation☆34Updated 2 years ago
- Turns KoboldAI into a crowdsourced distributed cluster☆31Updated 2 years ago
- Audio Splitter provides a user-friendly solution for splitting audio files based on silence detection.☆18Updated 2 years ago
- Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild☆17Updated last year
- C++ library for converting text to phonemes for Piper☆134Updated 4 months ago
- A fast MP3 decoder for python, using minimp3☆29Updated 3 years ago
- an improved version of Real-time-voice-cloning☆52Updated last year
- Run Stable diffusion 3 on low VRAM systems☆28Updated last year
- Gradio Client in Rust.☆28Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- Coqui AI TTS plugin☆87Updated 4 months ago
- A collection of handy helpers for AI art generation, AI writing and other experimental tools☆52Updated last year
- A Gradio setup for Tortoise TTS.☆45Updated 2 years ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆71Updated 2 years ago
- Quantized text-audio foundation model from Boson AI☆41Updated 3 months ago
- ☆17Updated 8 months ago
- Stable Diffusion in pure C/C++☆15Updated 3 weeks ago
- ☆18Updated 3 years ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆68Updated 5 months ago
- SoTA open-source TTS☆110Updated 5 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 2 months ago
- ☆40Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆122Updated last week
- A random walk voice style cloning application for Kokoro text to speech☆169Updated 5 months ago
- Image synthesis using machine learning☆22Updated 6 months ago
- ☆99Updated last year
- ☆24Updated last year