nii-yamagishilab/self-attention-tacotron

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nii-yamagishilab/self-attention-tacotron)

nii-yamagishilab / self-attention-tacotron

An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" https://arxiv.org/abs/1810.11960

☆114

Alternatives and similar repositories for self-attention-tacotron

Users that are interested in self-attention-tacotron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nii-yamagishilab / tacotron2
View on GitHub
An implementation of Tacotron and Tacotron2
☆80Aug 4, 2021Updated 4 years ago
yanggeng1995 / vae_tacotron
View on GitHub
☆51Feb 15, 2019Updated 7 years ago
BridgetteSong / ExpressiveTacotron
View on GitHub
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…
☆74Sep 21, 2022Updated 3 years ago
ZackHodari / average_prosody
View on GitHub
Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…
☆24Dec 8, 2019Updated 6 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rishikksh20 / vae_tacotron2
View on GitHub
VAE Tacotron 2, an alternative of GST Tacotron
☆91Jul 6, 2023Updated 3 years ago
dipjyoti92 / SC-WaveRNN
View on GitHub
Official PyTorch implementation of Speaker Conditional WaveRNN
☆110Jun 22, 2022Updated 4 years ago
nii-yamagishilab / multi-speaker-tacotron
View on GitHub
VCTK multi-speaker tacotron for ICASSP 2020
☆266Mar 29, 2022Updated 4 years ago
LEEYOONHYUNG / BVAE-TTS
View on GitHub
Official implementation of BVAE-TTS
☆173Sep 26, 2022Updated 3 years ago
bshall / Tacotron
View on GitHub
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
☆115Dec 2, 2020Updated 5 years ago
cnlinxi / tpse_tacotron2
View on GitHub
TPSE-GST Tacotron2
☆14May 1, 2019Updated 7 years ago
ivanvovk / compressed-tacotron2-pytorch
View on GitHub
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
☆22Dec 26, 2019Updated 6 years ago
HappyBall / tacotron
View on GitHub
tacotron for research on Chinese speech synthesis and Taiwanese speech synthesis from Chinese input text sequence with different granular…
☆25Aug 2, 2018Updated 8 years ago
AzamRabiee / Emotional-TTS
View on GitHub
Korean Emotional End-to-End Neural Speech synthesizer, ML4audio, NIPS2017
☆72Aug 22, 2019Updated 6 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
keonlee9420 / Robust_Fine_Grained_Prosody_Control
View on GitHub
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
☆41Feb 20, 2022Updated 4 years ago
ksw0306 / ClariNet
View on GitHub
A Pytorch Implementation of ClariNet
☆293Aug 5, 2019Updated 6 years ago
Labmem-Zhouyx / CDFSE_FastSpeech2
View on GitHub
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Dec 20, 2022Updated 3 years ago
syang1993 / FFTNet
View on GitHub
A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder
☆94Jul 17, 2018Updated 8 years ago
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
ljuvela / ResGAN
View on GitHub
Core code for my ICASSP 2018 paper
☆53Jul 27, 2018Updated 8 years ago
Daisyqk / Automatic-Prosody-Annotation
View on GitHub
☆112Mar 9, 2026Updated 4 months ago
bshall / UniversalVocoding
View on GitHub
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆238Nov 14, 2020Updated 5 years ago
thuhcsi / VAENAR-TTS
View on GitHub
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
☆144Jul 8, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Wendison / FCL-taco2
View on GitHub
Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021
☆41Jul 17, 2021Updated 5 years ago
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
View on GitHub
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Jan 27, 2020Updated 6 years ago
ndkgit339 / spe-dss
View on GitHub
Speech Parameter Estimation Using Differentiable Speech Synthesizer
☆43May 9, 2023Updated 3 years ago
hhguo / MSMC-TTS
View on GitHub
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
☆168Apr 10, 2024Updated 2 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
andabi / parallel-wavenet-vocoder
View on GitHub
A WaveNet-based vocoder for fast inference
☆163Jun 10, 2018Updated 8 years ago
rishikksh20 / Avocodo-pytorch
View on GitHub
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
☆122Jul 14, 2022Updated 4 years ago
LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
ZackHodari / discrete_intonation
View on GitHub
Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…
☆17May 24, 2020Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
NVIDIA / mellotron
View on GitHub
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆869Jul 22, 2023Updated 3 years ago
kan-bayashi / Taco2withBERT
View on GitHub
Tacotron2 with BERT examples
☆10Jul 8, 2019Updated 7 years ago
Deepest-Project / AlignTTS
View on GitHub
Implementation of the AlignTTS
☆77Jul 6, 2023Updated 3 years ago
janvainer / speedyspeech
View on GitHub
☆262Dec 8, 2022Updated 3 years ago
NVIDIA / radtts
View on GitHub
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …
☆291Apr 6, 2023Updated 3 years ago
CODEJIN / VITS_Diffusion
View on GitHub
☆26Sep 22, 2022Updated 3 years ago
sos1sos2Sixteen / aishell-3-baseline-fc
View on GitHub
The code for aishell-3 baseline acoustic model
☆70Nov 30, 2020Updated 5 years ago