Bartelds / neural-acoustic-distance
Code associated with the paper: Neural Representations for Modeling Variation in Speech.
☆18Updated 3 years ago
Alternatives and similar repositories for neural-acoustic-distance
Users that are interested in neural-acoustic-distance are comparing it to the libraries listed below
Sorting:
- ☆40Updated 3 years ago
- ☆33Updated 3 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 10 months ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆81Updated 3 years ago
- Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).☆27Updated last year
- Layer-wise analysis of self-supervised pre-trained speech representations☆103Updated 7 months ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- multilingual speech aligner☆74Updated last year
- An evaluation toolkit for voice conversion models.☆42Updated 3 years ago
- Alignment files of LibriTTS.☆61Updated 5 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- ☆29Updated 3 years ago
- ☆64Updated last year
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- A list of papers for child ASR☆40Updated 7 months ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆82Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆112Updated 2 months ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆120Updated 2 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- ☆52Updated 4 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated 2 years ago
- Deep Articulatory Synthesis and Inversion☆49Updated last year
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆55Updated 2 months ago
- ☆25Updated 9 months ago
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated 2 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆85Updated last year
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- A system works on singing voice synthesis☆79Updated 2 years ago
- ☆17Updated 2 years ago