gooofy / zerovoxLinks
zero-shot realtime TTS system, fully offline, free and open source
☆39Updated last month
Alternatives and similar repositories for zerovox
Users that are interested in zerovox are comparing it to the libraries listed below
Sorting:
- StyleTTS 2 Optimized Training Fork☆29Updated 3 months ago
- High quality text-to-speech based on StyleTTS 2.☆47Updated this week
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆18Updated last week
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated last month
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- StyleTTS2 + Vocos as a Decoder☆12Updated 2 months ago
- ☆20Updated last week
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆24Updated last month
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last week
- Create an LJSpeech structured voice dataset on wave input☆30Updated 8 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆36Updated last week
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- The EveryVoice TTS Toolkit - Text To Speech for your language☆33Updated this week
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 4 months ago
- VoiceBox neural network implementation☆108Updated 9 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated last year
- create dataset from list of youtube links easily☆18Updated 2 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 11 months ago
- Lyra V2 (SoundStream) running in the browser☆18Updated last year
- A fast MP3 decoder for python, using minimp3☆28Updated 2 years ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆72Updated 7 months ago
- ☆29Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 9 months ago
- ☆40Updated 3 months ago
- Unofficial implementation of wavenext vocoder☆46Updated 9 months ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆135Updated 4 months ago
- ☆35Updated last year
- ☆50Updated 2 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆37Updated last week