rmcpantoja / piper
A fast, local neural text to speech system
☆13Updated 2 months ago
Alternatives and similar repositories for piper:
Users that are interested in piper are comparing it to the libraries listed below
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 6 months ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- ☆13Updated last month
- A fast MP3 decoder for python, using minimp3☆28Updated 2 years ago
- The source code of the game I made for the HuggingFace game jam☆14Updated last year
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆15Updated this week
- ☆39Updated last year
- An open source real-time AI inference engine for seamless scaling☆18Updated 3 weeks ago
- Turns KoboldAI into a crowdsourced distributed cluster☆33Updated last year
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆17Updated last month
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆23Updated last month
- Tools to isolate speaker and transcribe unstructured audio clips☆11Updated 2 years ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆68Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 8 months ago
- A guide to help newcomers to the Piper TTS system create voices for NVDA and other screen readers down the line.☆25Updated last year
- High quality text-to-speech based on StyleTTS 2.☆39Updated this week
- ☆36Updated last year
- ☆16Updated last month
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆52Updated this week
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- A simple extension that uses Bark Text-to-Speech for audio output☆35Updated last year
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion☆13Updated 2 years ago
- Finally, some decent sample sentences☆22Updated last year
- Testbed for the fastest SD pipelines☆35Updated last year
- ☆21Updated 2 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Run Stable diffusion 3 on low VRAM systems☆28Updated 10 months ago
- Make-A-Video Latent Diffusion Model☆18Updated last year