gooofy / zerovox
zero-shot realtime TTS system, fully offline, free and open source
☆23Updated this week
Alternatives and similar repositories for zerovox:
Users that are interested in zerovox are comparing it to the libraries listed below
- ☆23Updated 2 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 7 months ago
- ☆12Updated 4 months ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆27Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 5 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆19Updated 3 months ago
- ☆28Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆21Updated 8 months ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆31Updated 2 months ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆14Updated 9 months ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 5 months ago
- StyleTTS 2 Optimized Training Fork☆15Updated this week
- ☆35Updated 3 months ago
- Just another FastSpeech 2 but cleaner code :)☆25Updated 6 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated last year
- An open-source Kazakh Emotional Text-to-Speech Dataset☆27Updated 9 months ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆15Updated last year
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago
- ☆24Updated last year
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆36Updated this week
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆40Updated 4 months ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 11 months ago
- ☆10Updated 2 months ago
- Viterbi decoding in PyTorch☆27Updated 3 months ago
- ☆33Updated last year
- Aligner for text-to-speech☆15Updated 5 months ago
- Streaming Vocos☆19Updated last week
- Production-ready vocoder using BigVSAN☆11Updated 11 months ago