rgzn-aiyun / tacotron2-melganView external linksLinks
Mel spectrum based on tacotron2 for melgan speech synthesis
☆15Mar 24, 2023Updated 2 years ago
Alternatives and similar repositories for tacotron2-melgan
Users that are interested in tacotron2-melgan are comparing it to the libraries listed below
Sorting:
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)☆14May 19, 2021Updated 4 years ago
- ☆15May 8, 2021Updated 4 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 4 years ago
- PyTorch implementation of A Neural Algorithm of Artistic Style☆10Dec 20, 2019Updated 6 years ago
- CCMusic, an open Chinese music database, integrates diverse datasets. It ensures data consistency via cleaning, label refinement and stru…☆26Oct 31, 2025Updated 3 months ago
- A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. The model has been trained with the English read-speech LJ…☆10Sep 4, 2023Updated 2 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- Using VAEs to do clustering for classification☆11Nov 5, 2017Updated 8 years ago
- ☆31Nov 7, 2018Updated 7 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆17Apr 27, 2023Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Sep 26, 2019Updated 6 years ago
- Tensorflow Implementation of WaveGlow☆37May 4, 2020Updated 5 years ago
- code for "BEAT-ALIGNED SPECTROGRAM-TO-SEQUENCE GENERATION OF RHYTHM-GAME CHARTS" (ISMIR 2023 LBD)☆18Jan 29, 2024Updated 2 years ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- ☆37May 8, 2021Updated 4 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Mar 17, 2023Updated 2 years ago
- Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation☆18Nov 28, 2023Updated 2 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 6 years ago
- Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.☆19Oct 28, 2025Updated 3 months ago
- Bachelor's thesis carried at Universitat Politecnica de Catalunya in partial fullfilment of the requirements for the degree in Telecommun…☆16Jul 25, 2024Updated last year
- Tacotron2 with Global Style Tokens☆65Apr 19, 2019Updated 6 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- ☆19Feb 2, 2023Updated 3 years ago
- Framework for one-shot multispeaker system based on Deep Learning☆19May 30, 2021Updated 4 years ago
- Wavenet pytorch implementation for text-to-speech☆18Jul 19, 2023Updated 2 years ago
- ☆17Aug 27, 2025Updated 5 months ago
- This paper has been accepted in ACM ICMR 2021.☆20Nov 17, 2025Updated 3 months ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆36Dec 24, 2025Updated last month
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆90Jul 6, 2023Updated 2 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)☆22Oct 14, 2017Updated 8 years ago
- Generating drum loops using the Wave-U-Net conditioned on intuitive parameters.☆24Nov 19, 2020Updated 5 years ago