siddiquelatif / URDU-Dataset
Urdu Language Speech Emotional Corpus
☆45Updated 6 years ago
Alternatives and similar repositories for URDU-Dataset:
Users that are interested in URDU-Dataset are comparing it to the libraries listed below
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆129Updated 2 months ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆132Updated 3 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- ☆28Updated 2 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- ☆27Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 3 years ago
- ☆49Updated 3 years ago
- A unified dataset of multilingual emotional human utterances☆25Updated 3 years ago
- Implementation of Multi speaker TTS☆51Updated 4 years ago
- ☆106Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆81Updated 3 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆42Updated 3 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆142Updated last year
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆81Updated 2 years ago
- ☆50Updated last year
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Emotional Speech Conversion using Style Transfer and MUNIT☆33Updated 5 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated 2 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago