resemble-ai / phonemizer
Simple text to phonemes converter for multiple languages
☆21Updated last year
Related projects: ⓘ
- A simple voice conversion tool☆15Updated 2 years ago
- ☆56Updated this week
- Simple PyTorch Denoisers for Waveform Audio☆31Updated 4 months ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆45Updated last year
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆21Updated last month
- ☆31Updated 2 years ago
- Finally, some decent sample sentences☆21Updated 9 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆39Updated 2 months ago
- Lyra V2 (SoundStream) running in the browser☆17Updated last year
- Demo for 2022 ICASSP☆64Updated 2 years ago
- Interface for Controllable Expressive Talking Machine☆37Updated 8 months ago
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- ☆35Updated this week
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆66Updated 3 years ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 2 years ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆22Updated last year
- Code for AccentDB.☆20Updated 3 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆27Updated 4 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆17Updated 2 years ago
- ☆75Updated 3 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆74Updated 2 months ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆36Updated 2 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆69Updated 3 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆67Updated last year
- Collect Voice Conversion researches☆90Updated this week
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆23Updated 4 years ago