aldragan0 / voice-recognition
Voice-based gender, age and language recognition.
☆40Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for voice-recognition
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆111Updated last year
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆126Updated 2 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆64Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆32Updated 2 years ago
- ☆46Updated 11 months ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- A unified dataset of multilingual emotional human utterances☆23Updated 2 years ago
- ☆98Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆48Updated 2 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆128Updated 10 months ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆16Updated last year
- A random forest classifier to predict the age-group and gender of a speaker from voice measurements.☆16Updated 5 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆46Updated 3 years ago
- ☆43Updated last year
- ☆27Updated 2 years ago
- target speaker extraction and verification for multi-talker speech☆166Updated 3 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- ☆45Updated 3 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆58Updated 3 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Updated 5 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆89Updated 3 years ago
- ☆59Updated 4 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆143Updated 2 years ago
- Unofficial implementation of ECAPA-TDNN☆27Updated 3 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆126Updated 3 weeks ago
- ☆59Updated 2 months ago
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆15Updated 3 years ago