CookiePPP / pag-tacotron2Links

[NOT-in-Progress] PyTorch implementation of "Pre-Alignment Guided Attention for Improving Training Efficiency and Model Stability in End-to-End Speech Synthesis"

☆9

Alternatives and similar repositories for pag-tacotron2

Users that are interested in pag-tacotron2 are comparing it to the libraries listed below

Sorting:

iisys-hof / HUI-Audio-Corpus-German
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…
☆31Updated 2 years ago
prml-lab-speech-team / demo
☆25Updated 10 months ago
CSTR-Edinburgh / ophelia
Sequence-to-sequence TTS based on Kyubyong's dc_tts
☆60Updated 2 years ago
keonlee9420 / Daft-Exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
☆57Updated 3 years ago
BridgetteSong / ExpressiveTacotron
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…
☆74Updated 2 years ago
keonlee9420 / Robust_Fine_Grained_Prosody_Control
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
☆41Updated 3 years ago
xinjli / alqalign
multilingual speech aligner
☆74Updated last year
ZackHodari / discrete_intonation
Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…
☆17Updated 5 years ago
Tomiinek / Blizzard2013_Segmentation
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
☆44Updated 5 years ago
guanlongzhao / ppg-gmm
Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"
☆36Updated 5 years ago
choiHkk / CVAEJETS
Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech
☆46Updated 2 years ago
CODEJIN / Glow_TTS
An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.
☆53Updated 2 years ago
Dapwner / CVAE-Tacotron
☆23Updated last year
Takaaki-Saeki / zm-text-tts
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆63Updated 2 years ago
lifeiteng / NaturalSpeech2
☆33Updated 2 years ago
hcy71o / AutoVocoder
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆70Updated 2 years ago
choiHkk / pitch-control-vits
☆30Updated 2 years ago
yanggeng1995 / vae_tacotron
☆51Updated 6 years ago
jefflai108 / Unsupervised-TTS
☆42Updated 3 years ago
b04901014 / UUVC
Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…
☆80Updated 2 years ago
Deepest-Project / AlignTTS
Implementation of the AlignTTS
☆76Updated last year
seahore / PPG-GradVC
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
☆44Updated last year
lifeiteng / TTS-TextAnalyzer
TTS Text Analyzer
☆32Updated last year
prosodylab / prosobeast-annotation-tool
☆40Updated 3 years ago
rishikksh20 / multiband-hifigan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
☆43Updated 4 years ago
cnlinxi / tpse_tacotron2
TPSE-GST Tacotron2
☆15Updated 6 years ago
insunhwang89 / StyleVC
☆31Updated 2 years ago
aixplain / tts-qa
☆63Updated last year
CODEJIN / VITS_Diffusion
☆26Updated 2 years ago
CSTR-Edinburgh / qualtreats
Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.
☆36Updated last year