meelement / noise_adversarial_tacotronView external linksLinks
Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization
☆17Aug 15, 2019Updated 6 years ago
Alternatives and similar repositories for noise_adversarial_tacotron
Users that are interested in noise_adversarial_tacotron are comparing it to the libraries listed below
Sorting:
- ☆15May 8, 2021Updated 4 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Mar 17, 2023Updated 2 years ago
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- The code for aishell-3 baseline acoustic model☆69Nov 30, 2020Updated 5 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Sep 28, 2020Updated 5 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- ☆10Apr 22, 2019Updated 6 years ago
- Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"☆15Feb 27, 2019Updated 6 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆34Aug 11, 2020Updated 5 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Mar 24, 2023Updated 2 years ago
- This repository contains laughter-related synthesis systems.☆13Nov 7, 2020Updated 5 years ago
- MFA acoustic model training based on Opencpop☆15Sep 23, 2022Updated 3 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14May 5, 2020Updated 5 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆114Dec 2, 2020Updated 5 years ago
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Oct 12, 2020Updated 5 years ago
- Predict prosody labels for Chinese sentences.☆41Jul 7, 2022Updated 3 years ago
- A Pytorch implementation of StarGAN-VC2☆17Jul 28, 2020Updated 5 years ago
- GAN series for voice conversion on VCC2018 dataset☆17Aug 27, 2020Updated 5 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 4 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆52Jun 17, 2025Updated 7 months ago
- VAE Tacotron 2, an alternative of GST Tacotron☆90Jul 6, 2023Updated 2 years ago
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf☆371Nov 5, 2021Updated 4 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆87Jul 25, 2022Updated 3 years ago
- 未来杯语音赛道说话人识别的baseline☆49Apr 9, 2019Updated 6 years ago
- Geometry features for block window cover song identification (a continuation of my ISMIR 2015 paper)☆24Jul 6, 2023Updated 2 years ago
- ☆24Jul 22, 2019Updated 6 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year
- 一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.☆24Jul 13, 2019Updated 6 years ago
- voice morphing☆24May 2, 2018Updated 7 years ago
- ☆54Jul 21, 2019Updated 6 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆160Jun 5, 2025Updated 8 months ago
- Talking head animation☆28Dec 8, 2023Updated 2 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago