yistLin / universal-vocoder
A PyTorch implementation of the universal neural vocoder
☆66Updated 3 years ago
Related projects: ⓘ
- ☆96Updated 3 years ago
- Alignment files of LibriTTS.☆57Updated 4 years ago
- Tacotron2 with Global Style Tokens☆61Updated 5 years ago
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆114Updated 3 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 2 years ago
- Implementation of the AlignTTS☆76Updated last year
- multilingual speech aligner☆70Updated 10 months ago
- An implementation of SkipVQVC with various settings.☆75Updated 4 years ago
- A pytroch implementation of the FB-MelGAN☆84Updated 4 years ago
- ☆72Updated last year
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆101Updated 3 years ago
- Speech (audio) subjective evaluation system☆37Updated 4 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆86Updated 2 years ago
- ☆45Updated 4 years ago
- ☆67Updated 3 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆109Updated 2 years ago
- ☆69Updated this week
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆32Updated last year
- ☆29Updated 2 years ago
- A sequence-to-sequence voice conversion toolkit.☆84Updated 2 months ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆92Updated last year
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated last year
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆43Updated 4 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆85Updated last year
- ☆52Updated 3 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆73Updated last year
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆52Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆115Updated 2 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 4 years ago
- Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assess…☆46Updated 7 months ago