gooofy / zerovoxLinks
zero-shot realtime TTS system, fully offline, free and open source
☆41Updated 4 months ago
Alternatives and similar repositories for zerovox
Users that are interested in zerovox are comparing it to the libraries listed below
Sorting:
- High quality text-to-speech based on StyleTTS 2.☆60Updated this week
- StyleTTS 2 Optimized Training Fork☆33Updated 6 months ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆30Updated 3 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆100Updated 2 weeks ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆12Updated 4 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 5 months ago
- An unofficial PyTorch implementation of VALL-E☆88Updated 3 weeks ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated 10 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆25Updated last year
- VoiceBox neural network implementation☆109Updated last year
- a Frontier Japanese Speech Generation net☆51Updated 3 months ago
- ☆42Updated last month
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated last year
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆122Updated last year
- ☆14Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆84Updated 9 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆114Updated 2 months ago
- ☆29Updated last year
- An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".☆72Updated 4 months ago
- VALL-E 2 reproduction☆129Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated 3 weeks ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆69Updated last year
- Implementation of Emo-StarGAN☆45Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆20Updated 3 weeks ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57Updated 3 months ago
- Zero-Shot Emotion Style Transfer☆49Updated 4 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆118Updated last month
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Updated 10 months ago