rosinality / melgan-pytorchView external linksLinks
MelGAN and Tacotron 2 in PyTorch
☆11Oct 22, 2019Updated 6 years ago
Alternatives and similar repositories for melgan-pytorch
Users that are interested in melgan-pytorch are comparing it to the libraries listed below
Sorting:
- text to speech☆10Mar 19, 2024Updated last year
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- Implementation of the paper: StyleBERT: Text-Audio Sentiment Analysis with Bi-directional Style Enhancement☆14Apr 10, 2023Updated 2 years ago
- ParallelWaveGAN adaptation for Mozilla TTS☆15May 23, 2020Updated 5 years ago
- PyTorch-based implementations of short-time Fourier transform☆15Jul 21, 2025Updated 6 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 5 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Prosody Transfer Tacotron☆19May 22, 2018Updated 7 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆20Oct 23, 2019Updated 6 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Aug 30, 2019Updated 6 years ago
- ☆22Apr 4, 2023Updated 2 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆49Mar 10, 2021Updated 4 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- Quasi-Periodic Parallel WaveGAN Pytorch implementation☆46Oct 29, 2022Updated 3 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- Implementation of Multi speaker TTS☆51Jan 2, 2021Updated 5 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Aug 15, 2022Updated 3 years ago
- Prosodic Speech Segmentation with Transformers☆26Feb 25, 2024Updated last year
- ☆25Mar 12, 2022Updated 3 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated 2 weeks ago
- ☆26Apr 21, 2021Updated 4 years ago
- TTS Text Analyzer☆32Jul 20, 2023Updated 2 years ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 2 years ago
- ncnn HiFi-GAN☆29Sep 29, 2024Updated last year
- A PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio" (ICML 2020)☆128Jul 25, 2024Updated last year
- A Pytorch Implementation of MelGAN☆66Oct 22, 2019Updated 6 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- ☆11Dec 28, 2025Updated last month
- Real-Time High-Fidelity Speech Synthesis without GPU☆73Jul 29, 2024Updated last year
- A PyTorch implementation of the universal neural vocoder☆67Nov 6, 2020Updated 5 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- ☆31Nov 7, 2018Updated 7 years ago