rmcpantoja / piperLinks
A fast, local neural text to speech system
β16Updated 10 months ago
Alternatives and similar repositories for piper
Users that are interested in piper are comparing it to the libraries listed below
Sorting:
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ160Updated last year
- BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVCβ78Updated 5 months ago
- Public voice datasets used for our Text-to-Speech voices.β46Updated 6 months ago
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated last year
- A highly compressive and high-quality neural audio codec for speech models.β176Updated this week
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β131Updated 4 months ago
- SoTA open-source TTSβ124Updated 7 months ago
- C++ library for converting text to phonemes for Piperβ137Updated 5 months ago
- Real-time end-to-end singing voice convertionβ23Updated last year
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.β25Updated 9 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β127Updated 5 months ago
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β257Updated last year
- create dataset from list of youtube links easilyβ21Updated 2 years ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!β92Updated last month
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.β46Updated 3 months ago
- A fast MP3 decoder for python, using minimp3β29Updated 3 years ago
- Open Source Text-to-Speech GUI Tool running on TalkNetβ11Updated 3 years ago
- β40Updated last year
- zero-shot realtime TTS system, fully offline, free and open sourceβ50Updated 8 months ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.β71Updated 7 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedbackβ¦β10Updated 3 months ago
- Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wildβ17Updated last year
- OminiControl for the GPU Poorβ39Updated 11 months ago
- A WebUI to create speech to speech with any RVC v2 trained AI voiceβ21Updated last year
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)β30Updated last week
- A lightweight, efficient variation of the StyleTTSβ―2 textβtoβspeech model.β52Updated 7 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioβ16Updated last year
- β18Updated 3 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β69Updated 2 months ago
- Streaming and Fine-tuning for Chatterbox TTSβ253Updated 6 months ago