gooofy / zerovox
zero-shot realtime TTS system, fully offline, free and open source
☆34Updated 3 weeks ago
Alternatives and similar repositories for zerovox:
Users that are interested in zerovox are comparing it to the libraries listed below
- High quality text-to-speech based on StyleTTS 2.☆37Updated this week
- StyleTTS 2 Optimized Training Fork☆28Updated 3 months ago
- Open TTS models, built for streaming on the edge☆41Updated last month
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆14Updated last week
- ☆40Updated 3 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 7 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 3 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last month
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Simple PyTorch Denoisers for Waveform Audio☆35Updated 2 weeks ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆37Updated 5 months ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆23Updated last week
- Real-time end-to-end singing voice convertion☆21Updated 6 months ago
- Unofficial implementation of wavenext vocoder☆44Updated 8 months ago
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- AudioSR-Upsampling (any -> 48kHz)☆40Updated last year
- ☆29Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 11 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 11 months ago
- ☆26Updated 6 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated last month
- An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".☆67Updated 3 weeks ago
- Supervoice diffusion enhance☆26Updated 9 months ago
- Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture☆26Updated 3 weeks ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆16Updated last year
- Zero-Shot Emotion Style Transfer☆45Updated 2 weeks ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 9 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆30Updated this week
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆17Updated last month