rgzn-aiyun/tacotron2-melgan

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rgzn-aiyun/tacotron2-melgan)

rgzn-aiyun / tacotron2-melgan

Mel spectrum based on tacotron2 for melgan speech synthesis

☆15

Alternatives and similar repositories for tacotron2-melgan

Users that are interested in tacotron2-melgan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

boltomli / tacotron
View on GitHub
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
☆14May 19, 2021Updated 5 years ago
rgzn-aiyun / melgan-cpu
View on GitHub
Real-time melgan based on cpu ！！！
☆13Dec 3, 2019Updated 6 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
xinshengwang / ICASSP2021_paper_list-VC
View on GitHub
ICASSP 2021 accepted papers in term of voice conversion (VC)
☆18Apr 11, 2021Updated 5 years ago
dhgrs / pytorch-UniWaveNet
View on GitHub
☆31Nov 7, 2018Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
achintyagopal / VAE-Clustering
View on GitHub
Using VAEs to do clustering for classification
☆11Nov 5, 2017Updated 8 years ago
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
sarulab-speech / lightweight_spkr_anon
View on GitHub
Lightweight speaker anonymization [IEEE SLT2021]
☆27Jun 6, 2022Updated 4 years ago
xcmyz / Tacotron2-Pytorch
View on GitHub
follow NVIDIA, simplify it and support data parallel.
☆13Sep 26, 2019Updated 6 years ago
Connum / npm-pinyin2ipa
View on GitHub
Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation
☆19Nov 28, 2023Updated 2 years ago
AlexK-PL / GST_Tacotron2
View on GitHub
A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. The model has been trained with the English read-speech LJ…
☆10Sep 4, 2023Updated 2 years ago
cjerry1243 / TransferLearning-CLVC
View on GitHub
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion
☆40Oct 22, 2022Updated 3 years ago
WelkinYang / EMPHASIS-pytorch
View on GitHub
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
☆15Mar 31, 2019Updated 7 years ago
jinhan / tacotron2-gst
View on GitHub
Tacotron2 with Global Style Tokens
☆64Apr 19, 2019Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
adasegroup / OSM-one-shot-multispeaker
View on GitHub
Framework for one-shot multispeaker system based on Deep Learning
☆19May 30, 2021Updated 5 years ago
rishikksh20 / vae_tacotron2
View on GitHub
VAE Tacotron 2, an alternative of GST Tacotron
☆91Jul 6, 2023Updated 3 years ago
zengchang233 / CrossSinger
View on GitHub
The source code for the paper CrossSinger (asru2023)
☆18Oct 12, 2023Updated 2 years ago
rhoposit / multilingual_VQVAE
View on GitHub
☆37May 8, 2021Updated 5 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
MiscellaneousStuff / PhoneLM
View on GitHub
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Sep 4, 2023Updated 2 years ago
atomicoo / tacotron2-mandarin
View on GitHub
Tensorflow implementation of Chinese/Mandarin TTS (Text-to-Speech) based on Tacotron-2 model.
☆132Jul 6, 2023Updated 3 years ago
weixsong / WaveGlow
View on GitHub
Tensorflow Implementation of WaveGlow
☆37May 4, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dafyddg / RFA
View on GitHub
Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…
☆17Apr 27, 2023Updated 3 years ago
PeiChunChang / MS-SincResNet
View on GitHub
This paper has been accepted in ACM ICMR 2021.
☆20Nov 17, 2025Updated 8 months ago
JiJiJiang / ASV-Anti-Spoofing-DADA
View on GitHub
Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.
☆19Jul 17, 2026Updated last week
WangHelin1997 / DuTa-VC
View on GitHub
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆38Dec 5, 2023Updated 2 years ago
candlewill / CNTN
View on GitHub
ChiNese Text Normalization (CNTN) tool for Text-to-speech system
☆37Apr 12, 2018Updated 8 years ago
thuhcsi / icassp2021-emotion-tts
View on GitHub
Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/
☆34Mar 17, 2023Updated 3 years ago
evinpinar / wavenet_pytorch
View on GitHub
Wavenet pytorch implementation for text-to-speech
☆19Jul 19, 2023Updated 3 years ago
safwankdb / Neural-Style-Transfer
View on GitHub
PyTorch implementation of A Neural Algorithm of Artistic Style
☆10Dec 20, 2019Updated 6 years ago
LAION-AI / scaled-echo-tts
View on GitHub
Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024…
☆24Mar 29, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
m-toman / tacorn
View on GitHub
TTS framework integrating state of the art open source methods (2018/2019)
☆48Jun 9, 2026Updated last month
EMRAI / emrai-synthetic-diarization-corpus
View on GitHub
☆22Sep 24, 2018Updated 7 years ago
HuangCongQing / AlgorithmsAndDataStructure
View on GitHub
JAVA 算法数据结构代码演习实践
☆14Jan 5, 2023Updated 3 years ago
nc-ai / speech
View on GitHub
☆17Aug 27, 2025Updated 11 months ago
yanq / gspider
View on GitHub
a groory spider .
☆12Jul 15, 2017Updated 9 years ago
wxyBUPT / sxs_spider
View on GitHub
基于scrapy的音频网站爬取
☆12Nov 11, 2016Updated 9 years ago
ScazLab / ros_speech2text
View on GitHub
A ROS package that uses Google Cloud Speech to provide speech to text service
☆11Oct 27, 2019Updated 6 years ago