prosodylab / prosobeast-annotation-tool
☆40Updated 2 years ago
Related projects: ⓘ
- multilingual speech aligner☆70Updated 10 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆30Updated 2 months ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 4 years ago
- A list of papers for child ASR☆24Updated 5 months ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- A system works on singing voice synthesis☆78Updated last year
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆32Updated last year
- Phoneme segmentation using pre-trained speech models☆49Updated last year
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆31Updated 4 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆39Updated 3 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- Implementation of Global Style Token Tacotron in TensorFlow2☆25Updated 3 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- Speech (audio) subjective evaluation system☆37Updated 4 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 2 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 3 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆65Updated 5 months ago
- Yin pitch estimator in PyTorch☆113Updated last year
- Implementation of the AlignTTS☆76Updated last year
- Objective metrics used in several text-to-speech (TTS) papers.☆46Updated 2 years ago
- ☆45Updated 4 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆43Updated 4 years ago
- ☆54Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated last year
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 2 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆45Updated 4 months ago
- ☆52Updated 3 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago