Bartelds / neural-acoustic-distance
Code associated with the paper: Neural Representations for Modeling Variation in Speech.
☆18Updated 2 years ago
Alternatives and similar repositories for neural-acoustic-distance:
Users that are interested in neural-acoustic-distance are comparing it to the libraries listed below
- ☆40Updated 3 years ago
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆23Updated last month
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆34Updated 7 months ago
- Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).☆27Updated last year
- ☆23Updated last week
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆77Updated 10 months ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆38Updated 2 years ago
- Alignment files of LibriTTS.☆61Updated 4 years ago
- multilingual speech aligner☆72Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆31Updated last year
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆49Updated last year
- Layer-wise analysis of self-supervised pre-trained speech representations☆100Updated 3 months ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Trainable algorithm for automatic measurement of voice onset time☆64Updated last year
- Workflow for forced alignment between languages☆17Updated 11 months ago
- A list of papers for child ASR☆36Updated 4 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated 2 years ago
- ABX discrimination task in python☆43Updated 4 months ago
- Keyword spotting and forced alignment in any language☆50Updated 7 months ago
- ☆185Updated 9 months ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 2 years ago
- ☆64Updated last year