resemble-ai / phonemizerLinks
Simple text to phonemes converter for multiple languages
β20Updated 2 years ago
Alternatives and similar repositories for phonemizer
Users that are interested in phonemizer are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- β76Updated 3 years ago
- Tools to create your own voice dataset for TTS trainingβ68Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β101Updated 2 years ago
- Feature extractor for DL speech processing.β66Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 3 years ago
- Python library for audio augmentationβ84Updated 2 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with theβ¦β47Updated 2 years ago
- Code for AccentDB.β22Updated 4 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samplesβ24Updated 4 years ago
- A simple voice conversion toolβ18Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation suppβ¦β48Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systemsβ19Updated 4 years ago
- Collect Voice Conversion researchesβ93Updated this week
- Interface for Controllable Expressive Talking Machineβ38Updated last year
- Phoneme prediction from speech mel-spectrograms using RNN.β15Updated 6 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-spβ¦β57Updated 6 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transferβ37Updated 3 years ago
- A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICASβ¦β86Updated 2 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)β103Updated 2 years ago
- β32Updated 3 years ago
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).β43Updated 3 years ago
- asr2kβ52Updated last year
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REPβ¦β33Updated last year
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segmentsβ43Updated 4 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsamplingβ37Updated 4 years ago
- follow NVIDIA, simplify it and support data parallel.β13Updated 5 years ago
- Official PyTorch implementation of TTS Style Transferβ24Updated 3 years ago