jgarciapueyo / MelNet-SpeechGenerationView external linksLinks
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
☆24Sep 16, 2020Updated 5 years ago
Alternatives and similar repositories for MelNet-SpeechGeneration
Users that are interested in MelNet-SpeechGeneration are comparing it to the libraries listed below
Sorting:
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆62Jun 8, 2021Updated 4 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆49Mar 10, 2021Updated 4 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- Real-Time High-Fidelity Speech Synthesis without GPU☆73Jul 29, 2024Updated last year
- a pytorch implementation of Google GEDLoss☆32Dec 9, 2020Updated 5 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Oct 12, 2019Updated 6 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Jun 22, 2022Updated 3 years ago
- TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis☆88Feb 23, 2021Updated 4 years ago
- A pytroch implementation of the FB-MelGAN☆90May 26, 2020Updated 5 years ago
- ☆64Aug 14, 2023Updated 2 years ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 4 years ago
- ☆10Sep 17, 2021Updated 4 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Dec 31, 2023Updated 2 years ago
- 一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.☆24Jul 13, 2019Updated 6 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 4 years ago
- ☆49May 3, 2020Updated 5 years ago
- Deep Multi-Speech model☆11Jul 25, 2018Updated 7 years ago
- ☆11May 15, 2025Updated 9 months ago
- ☆69Mar 31, 2021Updated 4 years ago
- A Pytorch Implementation of MelNet☆26Apr 13, 2020Updated 5 years ago
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆14Jun 15, 2021Updated 4 years ago
- GSoC'16 RedHen Labs☆11Aug 22, 2016Updated 9 years ago
- bumble bee transformer☆14Apr 19, 2021Updated 4 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆16Jul 12, 2021Updated 4 years ago
- Implementation of Learning Bandwidth Expansion Using Perceptually-Motivated Loss (ICASSP 2019)☆11May 18, 2022Updated 3 years ago
- Built text generation models using LSTM & GPT2☆16Jul 10, 2020Updated 5 years ago
- An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/☆28Feb 12, 2021Updated 5 years ago
- TTS for pitch-accented language. Korean dialect DB.☆157May 12, 2023Updated 2 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆34Oct 30, 2020Updated 5 years ago
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Apr 29, 2020Updated 5 years ago
- ☆13Aug 11, 2018Updated 7 years ago
- Pytorch Implementation of WaveNODE☆64Sep 4, 2020Updated 5 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- Speech enhancement using mimic loss☆16Oct 25, 2019Updated 6 years ago