☆42Mar 25, 2022Updated 3 years ago
Alternatives and similar repositories for Unsupervised-TTS
Users that are interested in Unsupervised-TTS are comparing it to the libraries listed below
Sorting:
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆89Mar 5, 2022Updated 3 years ago
- ☆64May 23, 2022Updated 3 years ago
- A system works on singing voice synthesis☆79Jan 11, 2023Updated 3 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Jun 6, 2022Updated 3 years ago
- Streamlit app to visualize and edit TTS datasets☆15Dec 15, 2021Updated 4 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Oct 19, 2022Updated 3 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆203Nov 30, 2020Updated 5 years ago
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- visual-text to speech☆14Apr 3, 2022Updated 3 years ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆415Aug 29, 2023Updated 2 years ago
- Network specification and demo☆35Jun 5, 2017Updated 8 years ago
- ☆37May 8, 2021Updated 4 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆14Aug 7, 2022Updated 3 years ago
- Speech waveform synthesis filters☆13Jul 21, 2017Updated 8 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- ☆88Nov 1, 2022Updated 3 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆83Nov 13, 2021Updated 4 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …☆291Apr 6, 2023Updated 2 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆20Nov 15, 2022Updated 3 years ago
- ☆20Jun 5, 2022Updated 3 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆125Dec 14, 2020Updated 5 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆112Aug 18, 2021Updated 4 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55May 6, 2020Updated 5 years ago
- ☆17Aug 27, 2025Updated 6 months ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline☆432Apr 19, 2023Updated 2 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 3 years ago
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis☆230Jul 13, 2022Updated 3 years ago