Bartelds / neural-acoustic-distanceLinks
Code associated with the paper: Neural Representations for Modeling Variation in Speech.
☆18Updated 3 years ago
Alternatives and similar repositories for neural-acoustic-distance
Users that are interested in neural-acoustic-distance are comparing it to the libraries listed below
Sorting:
- ☆40Updated 3 years ago
- ☆50Updated 5 months ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Updated 2 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆84Updated 2 years ago
- multilingual speech aligner☆77Updated last year
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆66Updated 7 months ago
- High-Fidelity Neural Phonetic Posteriorgrams☆120Updated 8 months ago
- ☆111Updated 3 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 3 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- A system works on singing voice synthesis☆79Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆167Updated 4 months ago
- ☆193Updated last year
- Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner☆44Updated 4 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Updated 3 years ago
- Alignment files of LibriTTS.☆64Updated 5 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 4 years ago
- ☆64Updated 2 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆167Updated last year
- Charsiu: A neural phonetic aligner.☆318Updated 3 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆57Updated 2 months ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 5 years ago
- ☆65Updated last year
- Layer-wise analysis of self-supervised pre-trained speech representations☆118Updated last year
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 3 years ago
- Official implementation of SpeechSplit2☆134Updated 3 years ago
- A sequence-to-sequence voice conversion toolkit.☆104Updated last year