siddiquelatif / URDU-DatasetLinks
Urdu Language Speech Emotional Corpus
☆46Updated 6 years ago
Alternatives and similar repositories for URDU-Dataset
Users that are interested in URDU-Dataset are comparing it to the libraries listed below
Sorting:
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆137Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68Updated 4 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- ☆52Updated 4 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆148Updated 2 years ago
- The official repository for Audio ALBERT☆67Updated 3 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆87Updated 3 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Updated 2 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Updated 5 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- Implementation of Multi speaker TTS☆51Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 4 years ago
- ☆52Updated 5 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆62Updated 2 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆87Updated 6 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- PyTorch implementation of RPNSD☆60Updated last year
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 4 years ago
- MSP-Podcast Challenge Baseline Code☆30Updated last year
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆125Updated 5 years ago
- ☆30Updated 3 years ago
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.☆52Updated 4 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Updated 8 months ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 3 years ago
- Tacotron2 with Global Style Tokens☆65Updated 6 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 6 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 3 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆18Updated 2 years ago