resemble-ai / phonemizerLinks
Simple text to phonemes converter for multiple languages
☆20Updated 2 years ago
Alternatives and similar repositories for phonemizer
Users that are interested in phonemizer are comparing it to the libraries listed below
Sorting:
- ☆76Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Interface for Controllable Expressive Talking Machine☆38Updated last month
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 4 years ago
- A simple voice conversion tool☆19Updated 3 years ago
- Tools to create your own voice dataset for TTS training☆67Updated 5 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Updated 3 years ago
- ☆43Updated last year
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated 2 years ago
- Code for AccentDB.☆23Updated 4 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆24Updated 5 years ago
- asr2k☆52Updated last year
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments☆43Updated 4 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Non-Parallel Voice Conversion with Cyclic Variational Autoencoder☆52Updated 5 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Updated 2 years ago
- ☆32Updated 3 years ago
- The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.☆126Updated 4 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated last year
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Updated 4 years ago
- Labeled data for homograph disambiguation☆60Updated 2 years ago