deeplyinc / Nonverbal-Vocalization-DatasetLinks
☆41Updated 3 years ago
Alternatives and similar repositories for Nonverbal-Vocalization-Dataset
Users that are interested in Nonverbal-Vocalization-Dataset are comparing it to the libraries listed below
Sorting:
- Clustering-based methods for overlapping diarization☆82Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 3 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆89Updated 3 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆156Updated 3 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated 2 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆53Updated last year
- ☆32Updated last year
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Updated 4 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆52Updated 6 months ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Updated last year
- Alignment files of LibriTTS.☆65Updated 5 years ago
- Speech (audio) subjective evaluation system☆42Updated 5 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆61Updated 4 months ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Updated 8 months ago
- multilingual speech aligner☆77Updated 2 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆83Updated 2 years ago
- ☆61Updated last year
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆121Updated 2 years ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆87Updated 2 years ago
- ☆111Updated 3 years ago
- Official implementation of SpeechSplit2☆133Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 3 years ago
- A sequence-to-sequence voice conversion toolkit.☆106Updated last year
- ☆80Updated 4 months ago
- ☆66Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Updated last year
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆89Updated 3 years ago
- ☆69Updated last year
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆31Updated 2 years ago