rosinality/melgan-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rosinality/melgan-pytorch)

rosinality / melgan-pytorch

MelGAN and Tacotron 2 in PyTorch

☆11

Alternatives and similar repositories for melgan-pytorch

Users that are interested in melgan-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
sushant-t / tts-trainer
View on GitHub
Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…
☆30May 27, 2023Updated 3 years ago
erogol / ParallelWaveGAN
View on GitHub
ParallelWaveGAN adaptation for Mozilla TTS
☆15May 23, 2020Updated 6 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ttaoREtw / semi-tts
View on GitHub
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 6 years ago
Emrys365 / torch_stft
View on GitHub
PyTorch-based implementations of short-time Fourier transform
☆14Jul 21, 2025Updated last year
yanggeng1995 / Multi-band-WaveRNN
View on GitHub
☆45Dec 16, 2019Updated 6 years ago
bajibabu / postfilt_gan
View on GitHub
This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"
☆16Jun 27, 2018Updated 8 years ago
erogol / TTS_tf
View on GitHub
WIP Tensorflow implementation of https://github.com/mozilla/TTS
☆15Apr 11, 2020Updated 6 years ago
rgzn-aiyun / melgan-cpu
View on GitHub
Real-time melgan based on cpu ！！！
☆13Dec 3, 2019Updated 6 years ago
fss1t / CausalStarGANv2-VC
View on GitHub
☆22Apr 4, 2023Updated 3 years ago
lsq960124 / StyleBERT
View on GitHub
Implementation of the paper: StyleBERT: Text-Audio Sentiment Analysis with Bi-directional Style Enhancement
☆14Apr 10, 2023Updated 3 years ago
vinay-lanka / Pitch-Shift-Algorithm
View on GitHub
Phase Vocoder and Wavelet Transform Implementation for Pitch Shifting a sound signal
☆11Jul 27, 2020Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
hhguo / WaveRNN
View on GitHub
Based on https://github.com/fatchord/WaveRNN
☆24May 3, 2020Updated 6 years ago
zBaitu / rsfmt
View on GitHub
☆11Oct 20, 2024Updated last year
mutiann / neural-lexicon-reader
View on GitHub
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Jul 25, 2022Updated 4 years ago
CODEJIN / multi_speaker_tts
View on GitHub
Implementation of Multi speaker TTS
☆50Jan 2, 2021Updated 5 years ago
k2-fsa / kaldi-decoder
View on GitHub
Decoders from Kaldi using OpenFst
☆35Apr 10, 2026Updated 3 months ago
sarulab-speech / multi-speaker-dgp
View on GitHub
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Mar 23, 2021Updated 5 years ago
vliu15 / adversarial-tts
View on GitHub
End-to-end Text-to-Speech with Generative Adversarial Networks
☆20Feb 6, 2021Updated 5 years ago
cjerry1243 / TransferLearning-CLVC
View on GitHub
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion
☆40Oct 22, 2022Updated 3 years ago
yanggeng1995 / WaveGlow
View on GitHub
A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
☆20Oct 23, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
danilogr / gstreamwebcam
View on GitHub
Handy live video streamer using Gstreamer! (RTP on UDP + SDP file)
☆14Apr 13, 2026Updated 3 months ago
alesgenova / colormap
View on GitHub
A flexible library to map numerical values to colors
☆14Jan 6, 2023Updated 3 years ago
0xTiger / vortex
View on GitHub
A cellular automaton wasm example
☆12Feb 1, 2025Updated last year
rarefin / TTS_VAE
View on GitHub
Text to Speech Synthesis based on controllable latent representation
☆14Aug 30, 2019Updated 6 years ago
ljuvela / GELP
View on GitHub
☆27Apr 21, 2021Updated 5 years ago
fnichol / limitation
View on GitHub
Rate limiting using a fixed window counter for arbitrary keys, backed by Redis.
☆17Jun 14, 2023Updated 3 years ago
yistLin / universal-vocoder
View on GitHub
A PyTorch implementation of the universal neural vocoder
☆68Nov 6, 2020Updated 5 years ago
demerzel1 / Segmentation-generated-and-MRF
View on GitHub
This project is based on Opencv， and achieves the part of the generation of segmentation (using depth map) and image denoising using Mark…
☆11Oct 29, 2018Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
BishopFox / burpcage
View on GitHub
☆10May 25, 2023Updated 3 years ago
avi33 / universalmelgan
View on GitHub
This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631
☆23Aug 15, 2022Updated 3 years ago
ivanvovk / compressed-tacotron2-pytorch
View on GitHub
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
☆22Dec 26, 2019Updated 6 years ago
chrismcguire / gobberish
View on GitHub
Generates random utf-8 strings for fuzz t�sting character encoding probl�ms
☆11Aug 21, 2015Updated 10 years ago
touchardv / rzwaveway
View on GitHub
A Ruby library for communicating with the ZWave protocol stack from ZWay, running on the Raspberry Pi "razberry" add-on card (see http://…
☆14Oct 12, 2018Updated 7 years ago
ddimaria / rust-actix-starter
View on GitHub
A production-quality starter app using Actix 1.x
☆15Jun 14, 2023Updated 3 years ago
magicse / ncnn-hifi-GAN
View on GitHub
ncnn HiFi-GAN
☆30Sep 29, 2024Updated last year