jgarciapueyo/MelNet-SpeechGeneration

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jgarciapueyo/MelNet-SpeechGeneration)

jgarciapueyo / MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

☆25

Alternatives and similar repositories for MelNet-SpeechGeneration

Users that are interested in MelNet-SpeechGeneration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rishikksh20 / PPSpeech
View on GitHub
PPSpeech: Phrase based Parallel End-to-End TTS System
☆35Aug 31, 2020Updated 5 years ago
ljuvela / GELP
View on GitHub
☆27Apr 21, 2021Updated 5 years ago
yanggeng1995 / Multi-band-WaveRNN
View on GitHub
☆45Dec 16, 2019Updated 6 years ago
r9y9 / kiritan_singing
View on GitHub
Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.
☆28Dec 31, 2023Updated 2 years ago
Yablon / auorange
View on GitHub
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
☆62Jun 8, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
BogiHsu / WG-WaveNet
View on GitHub
Real-Time High-Fidelity Speech Synthesis without GPU
☆73Jul 29, 2024Updated 2 years ago
AppleHolic / multiband_melgan
View on GitHub
An unofficial implementation of https://arxiv.org/abs/2005.05106
☆50Mar 10, 2021Updated 5 years ago
dipjyoti92 / SC-WaveRNN
View on GitHub
Official PyTorch implementation of Speaker Conditional WaveRNN
☆110Jun 22, 2022Updated 4 years ago
rishikksh20 / TFGAN
View on GitHub
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
☆88Feb 23, 2021Updated 5 years ago
ViEm-ccy / GEDLoss_pytorch
View on GitHub
a pytorch implementation of Google GEDLoss
☆32Dec 9, 2020Updated 5 years ago
WelkinYang / EMPHASIS-pytorch
View on GitHub
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
☆15Mar 31, 2019Updated 7 years ago
nii-yamagishilab / Extended_VQVAE
View on GitHub
☆64Aug 14, 2023Updated 2 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
ttslr / python-MCD
View on GitHub
☆49May 3, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yanggeng1995 / FB-MelGAN
View on GitHub
A pytroch implementation of the FB-MelGAN
☆90May 26, 2020Updated 6 years ago
RayeRen / RayeRen
View on GitHub
☆11Apr 7, 2026Updated 3 months ago
hash2430 / pitchtron
View on GitHub
TTS for pitch-accented language. Korean dialect DB.
☆155May 12, 2023Updated 3 years ago
genea-workshop / Speech_driven_gesture_generation_with_autoencoder
View on GitHub
This is the official implementation for IVA '19 paper "Analyzing Input and Output Representations for Speech-Driven Gesture Generation".
☆10Jul 12, 2022Updated 4 years ago
AswinKumar1 / Forced-Alignment
View on GitHub
GSoC'16 RedHen Labs
☆11Aug 22, 2016Updated 9 years ago
berthyf96 / bwe_fftnet
View on GitHub
Implementation of Learning Bandwidth Expansion Using Perceptually-Motivated Loss (ICASSP 2019)
☆11May 18, 2022Updated 4 years ago
SubramaniKrishna / STFTgrad
View on GitHub
Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"
☆33Oct 30, 2020Updated 5 years ago
Sytronik / deep-griffinlim-iteration
View on GitHub
PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)
☆39Oct 12, 2019Updated 6 years ago
xushengyuan / VocalnetOpenDataset
View on GitHub
一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.
☆24Jul 13, 2019Updated 7 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
quinte22 / bumblebee
View on GitHub
bumble bee transformer
☆14Apr 19, 2021Updated 5 years ago
candlewill / RawNet
View on GitHub
RawNet: Fast End-to-End Neural Vocoder
☆43May 29, 2019Updated 7 years ago
shashankshirol / GeneratingNoisySpeechData
View on GitHub
A repository comprising of code for generation of noisy speech data from clean data using deep learning methods
☆16Jul 12, 2021Updated 5 years ago
chenjiaxiang / Chinese-dataset-for-speaker-identification
View on GitHub
☆10Sep 17, 2021Updated 4 years ago
OSU-slatelab / mimic-enhance
View on GitHub
Speech enhancement using mimic loss
☆16Oct 25, 2019Updated 6 years ago
bajibabu / postfilt_gan
View on GitHub
This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"
☆16Jun 27, 2018Updated 8 years ago
auspicious3000 / SpeechSplit-Demo
View on GitHub
Unsupervised Speech Decomposition via Triple Information Bottleneck
☆14Apr 29, 2020Updated 6 years ago
liusongxiang / efficient_tts
View on GitHub
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
☆116Dec 22, 2021Updated 4 years ago
deepakbaby / isegan
View on GitHub
Improved Speech Enhancement GANs
☆13Jun 24, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cjerry1243 / Tacotron2-SpeechGesture
View on GitHub
This is the official repository for our publication "The IVI Lab entry to the GENEA Challenge 2022 – A Tacotron2 Based Method for Co-Spee…
☆13May 2, 2023Updated 3 years ago
ricardokleinklein / deepMultiSpeech
View on GitHub
Deep Multi-Speech model
☆11Jul 25, 2018Updated 8 years ago
BridgetteSong / BunchedLPCnet
View on GitHub
This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.
☆14Jun 17, 2021Updated 5 years ago
ssarfjoo / improvedsegan
View on GitHub
This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…
☆38Mar 24, 2023Updated 3 years ago
LeoniusChen / Attentions-in-Tacotron
View on GitHub
☆69Mar 31, 2021Updated 5 years ago
Deepest-Project / MelNet
View on GitHub
Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
☆209Jul 25, 2024Updated 2 years ago
jaeyeun97 / MelNet
View on GitHub
A Pytorch Implementation of MelNet
☆26Apr 13, 2020Updated 6 years ago