This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631
☆23Aug 15, 2022Updated 3 years ago
Alternatives and similar repositories for universalmelgan
Users that are interested in universalmelgan are comparing it to the libraries listed below
Sorting:
- ☆23Sep 14, 2021Updated 4 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆29Mar 3, 2022Updated 4 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆73Aug 3, 2021Updated 4 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14May 5, 2020Updated 5 years ago
- ☆19Jun 28, 2022Updated 3 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Aug 3, 2021Updated 4 years ago
- Dissimilarity Matrix and Sounds from Timbre Space Representation of a Subtractive Synthesizer (Timbre, 2020)☆12Dec 17, 2021Updated 4 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆11Nov 25, 2021Updated 4 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Apr 7, 2022Updated 3 years ago
- ☆39Apr 15, 2024Updated last year
- torch version of LPCNet☆22Jul 8, 2020Updated 5 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- ☆25Mar 12, 2022Updated 4 years ago
- Official PyTorch implementation of TTS Style Transfer☆25Jun 22, 2022Updated 3 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆76Aug 30, 2021Updated 4 years ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆15Jun 12, 2023Updated 2 years ago
- ☆24Jul 22, 2019Updated 6 years ago
- ☆18Feb 9, 2020Updated 6 years ago
- NASH 2021 project... this may or may not end up working 🤷♂️☆12Dec 19, 2021Updated 4 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Aug 31, 2021Updated 4 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- ☆19Feb 2, 2023Updated 3 years ago
- TTS Text Analyzer☆31Jul 20, 2023Updated 2 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- Official Code for Assem-VC @ICASSP2022☆269May 16, 2022Updated 3 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆111Apr 1, 2024Updated last year
- A PyTorch implementation of the universal neural vocoder☆67Nov 6, 2020Updated 5 years ago
- Demo for 2022 ICASSP☆64Jun 14, 2022Updated 3 years ago
- ☆80Aug 8, 2025Updated 7 months ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago
- ☆23Dec 10, 2024Updated last year
- ☆13Nov 16, 2020Updated 5 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Nov 2, 2022Updated 3 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago