rmcpantoja / piperLinks
A fast, local neural text to speech system
☆15Updated 4 months ago
Alternatives and similar repositories for piper
Users that are interested in piper are comparing it to the libraries listed below
Sorting:
- Public voice datasets used for our Text-to-Speech voices.☆34Updated 2 weeks ago
- A fast MP3 decoder for python, using minimp3☆29Updated 2 years ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- zero-shot realtime TTS system, fully offline, free and open source☆41Updated 2 months ago
- ☆36Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 10 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆27Updated last month
- Official Repo for Chat AI☆32Updated 2 years ago
- ☆143Updated last year
- Minimalist stable-diffusion desktop application with only one executable file writen with golang ( No python ).☆18Updated 2 months ago
- Run SD1.x/2.x/3.x, SDXL, and FLUX.1 on your phone device☆24Updated last week
- Audio Splitter provides a user-friendly solution for splitting audio files based on silence detection.☆16Updated 2 years ago
- ☆39Updated last year
- Fully Portable RVC: Voice Cloning Software☆22Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last month
- C++ library for converting text to phonemes for Piper☆123Updated last year
- A collection of handy helpers for AI art generation, AI writing and other experimental tools☆52Updated 8 months ago
- Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild☆16Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Neural Text to speech model that is a perfect voice for a home assistant, audiobooks or for screen readers on Linux, Mac and Windows. A f…☆36Updated last year
- ☆22Updated this week
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated 2 months ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆68Updated last month
- create dataset from list of youtube links easily☆20Updated 2 years ago
- A comfyui typescript client for the bun runtime☆15Updated last month
- Tools to isolate speaker and transcribe unstructured audio clips☆11Updated 2 years ago
- On-device noise suppression powered by deep learning☆73Updated this week
- An open source real-time AI inference engine for seamless scaling☆19Updated 3 weeks ago
- Image synthesis using machine learning☆21Updated last month
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 11 months ago