Tomiinek / Blizzard2013_Segmentation
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
☆44Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Blizzard2013_Segmentation
- Alignment files of LibriTTS.☆59Updated 4 years ago
- Implementation of the AlignTTS☆76Updated last year
- ☆25Updated 3 months ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated last year
- Tacotron2 with Global Style Tokens☆63Updated 5 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆46Updated 2 years ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆188Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- ☆46Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- multilingual speech aligner☆71Updated 11 months ago
- ☆67Updated 3 years ago
- Speech (audio) subjective evaluation system☆37Updated 4 years ago
- ☆63Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆115Updated 2 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆36Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated last year
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 2 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆73Updated last year
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆87Updated 2 years ago
- A pytroch implementation of the FB-MelGAN☆86Updated 4 years ago
- Yin pitch estimator in PyTorch☆115Updated 2 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 4 years ago
- ☆51Updated 5 years ago
- ☆19Updated 5 months ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- A system works on singing voice synthesis☆79Updated last year
- Gaussian Mixture VAE Tacotron☆53Updated last year