siddiquelatif / URDU-Dataset
Urdu Language Speech Emotional Corpus
☆43Updated 5 years ago
Related projects: ⓘ
- A unified dataset of multilingual emotional human utterances☆22Updated 2 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆122Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 2 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 3 years ago
- ☆26Updated 2 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆135Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆57Updated 3 years ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆129Updated 2 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 2 years ago
- ☆26Updated 2 years ago
- CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition☆11Updated 4 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆63Updated 2 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆80Updated 5 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆84Updated 3 years ago
- ☆45Updated 3 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- Emotional Speech Conversion using Style Transfer and MUNIT☆33Updated 5 years ago
- The official repository for Audio ALBERT☆64Updated 2 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆56Updated 4 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- ☆43Updated 9 months ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆70Updated 2 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 3 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆29Updated 4 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated last year
- Transformer-based online speech recognition system with TensorFlow 2☆25Updated 3 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago