resemble-ai / phonemizerLinks
Simple text to phonemes converter for multiple languages
β20Updated 2 years ago
Alternatives and similar repositories for phonemizer
Users that are interested in phonemizer are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β107Updated 2 years ago
- β76Updated 3 years ago
- Tools to create your own voice dataset for TTS trainingβ68Updated 4 years ago
- Feature extractor for DL speech processing.β66Updated 3 years ago
- β32Updated 3 years ago
- Interface for Controllable Expressive Talking Machineβ38Updated 2 weeks ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 4 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with theβ¦β47Updated 2 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Manage audio and video datasetsβ31Updated 2 months ago
- Collect Voice Conversion researchesβ94Updated last week
- Python library for audio augmentationβ84Updated 2 years ago
- Toolbox for easy and qualitative one-shot voice conversionβ46Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation suppβ¦β48Updated 2 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accentβ77Updated 3 years ago
- Code for AccentDB.β23Updated 4 years ago
- asr2kβ52Updated last year
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".β34Updated 4 years ago
- Upsampling Artifacts in Neural Audio Synthesis β https://arxiv.org/abs/2010.14356β81Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognitionβ27Updated 5 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototypingβ14Updated 7 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transferβ37Updated 3 years ago
- β42Updated 3 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsamplingβ37Updated 4 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)β103Updated 2 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samplesβ24Updated 5 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]β27Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β82Updated 2 years ago
- LogMMSE speech enhancement/noise reductionβ30Updated 5 years ago