Bartelds / neural-acoustic-distance
Code associated with the paper: Neural Representations for Modeling Variation in Speech.
☆18Updated 3 years ago
Alternatives and similar repositories for neural-acoustic-distance:
Users that are interested in neural-acoustic-distance are comparing it to the libraries listed below
- ☆40Updated 3 years ago
- A list of papers for child ASR☆38Updated 5 months ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆85Updated last year
- Alignment files of LibriTTS.☆61Updated 5 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 8 months ago
- Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).☆27Updated last year
- multilingual speech aligner☆72Updated last year
- Layer-wise analysis of self-supervised pre-trained speech representations☆102Updated 5 months ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆81Updated 3 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆78Updated 11 months ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 5 years ago
- ☆64Updated last year
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆39Updated last week
- Keras-based python framework to compute phonological posterior probabilities from audio files☆41Updated 2 years ago
- ☆25Updated last month
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆49Updated last year
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- A system works on singing voice synthesis☆79Updated 2 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆49Updated 10 months ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- Deep Articulatory Synthesis and Inversion☆47Updated last year
- ☆112Updated 2 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆32Updated 4 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- Implementation of the AlignTTS☆76Updated last year
- ☆47Updated 4 years ago
- ☆53Updated 4 years ago