This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631
☆23Aug 15, 2022Updated 3 years ago
Alternatives and similar repositories for universalmelgan
Users that are interested in universalmelgan are comparing it to the libraries listed below
Sorting:
- ☆23Sep 14, 2021Updated 4 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆29Mar 3, 2022Updated 3 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆73Aug 3, 2021Updated 4 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Aug 3, 2021Updated 4 years ago
- ☆38Apr 15, 2024Updated last year
- ☆25Mar 12, 2022Updated 3 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Apr 7, 2022Updated 3 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆15Jun 12, 2023Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- TTS Text Analyzer☆32Jul 20, 2023Updated 2 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14May 5, 2020Updated 5 years ago
- A PyTorch implementation of the universal neural vocoder☆67Nov 6, 2020Updated 5 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆76Aug 30, 2021Updated 4 years ago
- ☆80Aug 8, 2025Updated 6 months ago
- ☆19Feb 2, 2023Updated 3 years ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆111Apr 1, 2024Updated last year
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- ☆18Feb 9, 2020Updated 6 years ago
- Official PyTorch implementation of TTS Style Transfer☆25Jun 22, 2022Updated 3 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- Demo for 2022 ICASSP☆64Jun 14, 2022Updated 3 years ago
- torch version of LPCNet☆22Jul 8, 2020Updated 5 years ago
- A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆20Oct 23, 2019Updated 6 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Oct 12, 2019Updated 6 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- ☆24Sep 27, 2022Updated 3 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- Official Code for Assem-VC @ICASSP2022☆269May 16, 2022Updated 3 years ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆184Mar 6, 2024Updated last year
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Aug 31, 2021Updated 4 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- Implementation for NATv2.☆23Feb 20, 2021Updated 5 years ago