resemble-ai / phonemizerLinks
Simple text to phonemes converter for multiple languages
β20Updated 3 years ago
Alternatives and similar repositories for phonemizer
Users that are interested in phonemizer are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β107Updated 2 years ago
- β76Updated 4 years ago
- Interface for Controllable Expressive Talking Machineβ39Updated 3 months ago
- Feature extractor for DL speech processing.β66Updated 3 years ago
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segmentsβ43Updated 4 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with theβ¦β47Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 4 years ago
- A simple voice conversion toolβ19Updated 3 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transferβ37Updated 3 years ago
- π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).β31Updated last year
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-spβ¦β57Updated 6 years ago
- Tools to create your own voice dataset for TTS trainingβ70Updated 5 years ago
- Collect Voice Conversion researchesβ96Updated this week
- LogMMSE speech enhancement/noise reductionβ30Updated 5 years ago
- Forced Alignments for Common Voiceβ32Updated 5 years ago
- β32Updated 4 years ago
- Toolbox for easy and qualitative one-shot voice conversionβ46Updated 4 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systemsβ19Updated 4 years ago
- Finally, some decent sample sentencesβ23Updated 2 years ago
- Singing voice detectionβ15Updated 7 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processingβ71Updated 3 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesisβ69Updated 4 years ago
- Python library for audio augmentationβ85Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation suppβ¦β48Updated 2 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysoxβ13Updated 7 years ago
- A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICASβ¦β86Updated 3 years ago
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).β43Updated 3 years ago
- Upsampling Artifacts in Neural Audio Synthesis β https://arxiv.org/abs/2010.14356β81Updated 4 years ago
- β42Updated 3 years ago