rmcpantoja / piperLinks
A fast, local neural text to speech system
☆17Updated 8 months ago
Alternatives and similar repositories for piper
Users that are interested in piper are comparing it to the libraries listed below
Sorting:
- Public voice datasets used for our Text-to-Speech voices.☆43Updated 4 months ago
- Minimalist stable-diffusion desktop application with only one executable file writen with golang ( No python ).☆18Updated 6 months ago
- Run Stable diffusion 3 on low VRAM systems☆28Updated last year
- Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild☆17Updated last year
- GradioUI for TortoiseTTS voice generation☆34Updated 2 years ago
- Image synthesis using machine learning☆22Updated 5 months ago
- Audio Splitter provides a user-friendly solution for splitting audio files based on silence detection.☆18Updated 2 years ago
- ☆40Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆45Updated 5 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- A fast MP3 decoder for python, using minimp3☆28Updated 3 years ago
- VTuber application which only requires your voice and microphone, no need for a webcam or other tracking nonsense.☆16Updated 4 months ago
- C++ library for converting text to phonemes for Piper☆134Updated 3 months ago
- A guide to help newcomers to the Piper TTS system create voices for NVDA and other screen readers down the line.☆25Updated last year
- A quick test using a Stable Diffusion server and Godot 4☆11Updated 2 years ago
- Tools to isolate speaker and transcribe unstructured audio clips☆11Updated 2 years ago
- Quantized text-audio foundation model from Boson AI☆38Updated 2 months ago
- ☆18Updated 3 years ago
- A random walk voice style cloning application for Kokoro text to speech☆158Updated 4 months ago
- A UI for the Piper TTS☆103Updated last year
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆15Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated last month
- ☆17Updated last year
- ☆14Updated 3 years ago
- A simple framework for using a local Koboldcpp LLM to help with story-writing☆24Updated last year
- UI app for training TTS/VC machine learning models for xVASynth, with several audio pre-processing tools, and dataset creation/management…☆100Updated last year
- ☆16Updated 7 months ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆68Updated 5 months ago