resemble-ai / phonemizerLinks
Simple text to phonemes converter for multiple languages
β20Updated 2 years ago
Alternatives and similar repositories for phonemizer
Users that are interested in phonemizer are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β102Updated 2 years ago
- β76Updated 3 years ago
- A simple voice conversion toolβ17Updated 3 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with theβ¦β47Updated 2 years ago
- Code for AccentDB.β22Updated 4 years ago
- Interface for Controllable Expressive Talking Machineβ38Updated last year
- Simple PyTorch Denoisers for Waveform Audioβ35Updated 2 months ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 3 years ago
- Feature extractor for DL speech processing.β66Updated 3 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysoxβ13Updated 7 years ago
- Tools to create your own voice dataset for TTS trainingβ67Updated 4 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transferβ37Updated 3 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samplesβ24Updated 4 years ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)β22Updated 3 years ago
- Finally, some decent sample sentencesβ23Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation suppβ¦β48Updated last year
- Prosodic Speech Segmentation with Transformersβ25Updated last year
- follow NVIDIA, simplify it and support data parallel.β13Updated 5 years ago
- β32Updated 3 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.β31Updated 2 years ago
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segmentsβ43Updated 4 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.β54Updated 2 years ago
- Real-time Speech Separation, Noise Suppression & Speaker Recognitionβ18Updated 6 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesisβ69Updated 3 years ago
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).β43Updated 3 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsamplingβ37Updated 4 years ago
- asr2kβ51Updated last year
- Official PyTorch implementation of TTS Style Transferβ24Updated 3 years ago