deeplyinc / Nonverbal-Vocalization-Dataset
☆26Updated 2 years ago
Related projects: ⓘ
- Speech (audio) subjective evaluation system☆37Updated 4 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆32Updated last year
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆43Updated 4 years ago
- A list of papers for child ASR☆24Updated 5 months ago
- Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assess…☆46Updated 7 months ago
- Tacotron2 with Global Style Tokens☆61Updated 5 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆93Updated 2 months ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 3 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆73Updated last year
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 2 years ago
- Voice conversion (VC) investigation using three variants of VAE☆56Updated 4 years ago
- ☆45Updated 4 years ago
- ☆25Updated last month
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated last year
- Objective metrics used in several text-to-speech (TTS) papers.☆46Updated 2 years ago
- ☆27Updated last year
- Clustering-based methods for overlapping diarization☆68Updated 8 months ago
- ☆28Updated last year
- Alignment files of LibriTTS.☆57Updated 4 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 2 years ago
- A Pytorch version of LPCNet, including dump weight☆30Updated 2 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆45Updated 4 months ago
- A toolkit for any-to-any encoder-decoder voice conversion systems☆80Updated last year
- multilingual speech aligner☆70Updated 10 months ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆86Updated 2 years ago
- Calculation of MCD (dB) between two speech waveforms☆55Updated 3 years ago
- Training code and trained checkpoints for ASGAN.☆60Updated 8 months ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆70Updated last year