resemble-ai / phonemizerLinks
Simple text to phonemes converter for multiple languages
β20Updated 2 years ago
Alternatives and similar repositories for phonemizer
Users that are interested in phonemizer are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β107Updated 2 years ago
- β76Updated 4 years ago
- Feature extractor for DL speech processing.β66Updated 3 years ago
- Forced Alignments for Common Voiceβ31Updated 5 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 4 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- β32Updated 3 years ago
- Tools to create your own voice dataset for TTS trainingβ68Updated 5 years ago
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segmentsβ43Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β81Updated 2 years ago
- Python library for handling audio datasets.β138Updated 2 years ago
- Interface for Controllable Expressive Talking Machineβ38Updated last month
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.β67Updated 6 years ago
- Toolbox for easy and qualitative one-shot voice conversionβ46Updated 3 years ago
- asr2kβ52Updated last year
- One-shot TTS with Improved Unseen Speaker and Style Transferβ37Updated 3 years ago
- Python library for audio augmentationβ84Updated 2 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)β103Updated 2 years ago
- Collect Voice Conversion researchesβ94Updated last week
- python wrapper for rnnoise libraryβ48Updated 2 years ago
- Labeled data for homograph disambiguationβ60Updated 2 years ago
- Code for AccentDB.β23Updated 4 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- β56Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation suppβ¦β48Updated 2 years ago
- β37Updated this week
- Speaker change detection using SincNet and an LSTM/Transformerβ55Updated 5 months ago
- β37Updated 4 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with theβ¦β47Updated 2 years ago