Bartelds / neural-acoustic-distanceLinks
Code associated with the paper: Neural Representations for Modeling Variation in Speech.
☆17Updated 3 years ago
Alternatives and similar repositories for neural-acoustic-distance
Users that are interested in neural-acoustic-distance are comparing it to the libraries listed below
Sorting:
- ☆54Updated 8 months ago
- ☆40Updated 3 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆84Updated last week
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- Predicts the level of noise and reverberation on your audiofiles☆174Updated 7 months ago
- Charsiu: A neural phonetic aligner.☆329Updated 3 years ago
- A sequence-to-sequence voice conversion toolkit.☆108Updated last year
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Updated 3 years ago
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆42Updated 4 years ago
- Alignment files of LibriTTS.☆67Updated 5 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆45Updated 3 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Updated 5 years ago
- Deep Articulatory Synthesis and Inversion☆54Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆121Updated 11 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- A system works on singing voice synthesis☆79Updated 3 years ago
- multilingual speech aligner☆76Updated 2 years ago
- Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner☆44Updated 4 years ago
- An evaluation toolkit for voice conversion models.☆42Updated 4 years ago
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆71Updated 10 months ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆173Updated 2 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆63Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Updated 3 years ago
- ☆196Updated last year
- ☆111Updated 3 years ago
- Python forced alignment☆94Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46Updated 2 years ago
- Keyword spotting and forced alignment in any language☆85Updated 5 months ago