rmcpantoja / piperLinks
A fast, local neural text to speech system
☆17Updated 11 months ago
Alternatives and similar repositories for piper
Users that are interested in piper are comparing it to the libraries listed below
Sorting:
- Public voice datasets used for our Text-to-Speech voices.☆46Updated 7 months ago
- A fast MP3 decoder for python, using minimp3☆30Updated 3 years ago
- C++ library for converting text to phonemes for Piper☆137Updated 6 months ago
- GradioUI for TortoiseTTS voice generation☆33Updated 2 years ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- an improved version of Real-time-voice-cloning☆52Updated last year
- Echo-TTS inference codebase☆84Updated last month
- ☆40Updated last year
- An experiment training a diffusion model on 32x32 pixel art characters☆38Updated 2 years ago
- Image synthesis using machine learning☆22Updated 8 months ago
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆142Updated 2 weeks ago
- Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild☆17Updated last year
- ☆17Updated 10 months ago
- Convert phoneme codes and lexicon formats for English speech synths☆48Updated last month
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- Dungeon procedural generator similar to whatabou's "One Page Dungeon"☆46Updated 3 weeks ago
- Minimalist stable-diffusion desktop application with only one executable file writen with golang ( No python ).☆18Updated 9 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated 3 months ago
- ☆100Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 4 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆50Updated 9 months ago
- ☆18Updated 3 years ago
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆33Updated 3 weeks ago
- Run Stable diffusion 3 on low VRAM systems☆29Updated last year
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆72Updated 7 months ago
- SoTA open-source TTS☆133Updated 7 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆257Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆219Updated 9 months ago
- ☆20Updated 2 years ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆40Updated 10 months ago