resemble-ai / phonemizerLinks
Simple text to phonemes converter for multiple languages
β20Updated 3 years ago
Alternatives and similar repositories for phonemizer
Users that are interested in phonemizer are comparing it to the libraries listed below
Sorting:
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β106Updated 2 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- β76Updated 4 years ago
- Feature extractor for DL speech processing.β66Updated 3 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- Tools to create your own voice dataset for TTS trainingβ70Updated 5 years ago
- Forced Alignments for Common Voiceβ32Updated 5 years ago
- A simple voice conversion toolβ19Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ71Updated 3 years ago
- Python library for audio augmentationβ85Updated 2 years ago
- asr2kβ52Updated last year
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]β26Updated 4 years ago
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".β34Updated 4 years ago
- β44Updated last year
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segmentsβ43Updated 4 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ65Updated 5 years ago
- Rescoring methods for end-to-end Automatic Speech Recognitionβ26Updated 5 years ago
- π₯ π€ The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterancesβ¦β32Updated 9 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Toolbox for easy and qualitative one-shot voice conversionβ46Updated 4 years ago
- Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper: https://arxiv.org/abs/2110.09β¦β73Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation suppβ¦β48Updated 2 years ago
- LogMMSE speech enhancement/noise reductionβ30Updated 5 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wiβ¦β92Updated 4 years ago
- A module for normalising text.β10Updated 6 years ago
- Python library for handling audio datasets.β138Updated 2 years ago
- Interface for Controllable Expressive Talking Machineβ40Updated 4 months ago
- π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).β31Updated last year
- Code for AccentDB.β23Updated 4 years ago