Bartelds / neural-acoustic-distanceLinks
Code associated with the paper: Neural Representations for Modeling Variation in Speech.
☆18Updated 3 years ago
Alternatives and similar repositories for neural-acoustic-distance
Users that are interested in neural-acoustic-distance are comparing it to the libraries listed below
Sorting:
- ☆40Updated 3 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆85Updated 2 years ago
- ☆44Updated 3 months ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 3 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆123Updated 3 years ago
- A system works on singing voice synthesis☆79Updated 2 years ago
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆41Updated 3 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆57Updated 3 years ago
- An evaluation toolkit for voice conversion models.☆42Updated 4 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆115Updated 6 months ago
- Alignment files of LibriTTS.☆64Updated 5 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆55Updated last week
- simple textgrid to csv converter☆26Updated 4 years ago
- A sequence-to-sequence voice conversion toolkit.☆102Updated last year
- multilingual speech aligner☆76Updated last year
- Official implementation of BVAE-TTS☆173Updated 2 years ago
- ☆111Updated 3 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆65Updated 5 months ago
- Deep Articulatory Synthesis and Inversion☆52Updated last year
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆89Updated 3 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆43Updated 2 years ago
- ☆193Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆169Updated 2 years ago
- Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner☆44Updated 4 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆56Updated 2 years ago