SuperKogito / SER-datasetsLinks
A collection of datasets for the purpose of emotion recognition/detection in speech.
☆399Updated last year
Alternatives and similar repositories for SER-datasets
Users that are interested in SER-datasets are comparing it to the libraries listed below
Sorting:
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆140Updated last year
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆153Updated 4 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆211Updated 3 years ago
- This is the GitHub page for publicly available emotional speech data.☆380Updated 4 years ago
- ☆112Updated 3 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆187Updated last year
- A multimodal approach on emotion recognition using audio and text.☆188Updated 5 years ago
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆119Updated 4 years ago
- Multilingual datasets with raw audio for speech emotion recognition☆30Updated 4 years ago
- Python package for openSMILE☆304Updated 2 weeks ago
- TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18☆298Updated last year
- ☆49Updated 2 years ago
- Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)☆497Updated 10 months ago
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆95Updated 2 years ago
- Variational Bayes HMM over x-vectors diarization☆283Updated 2 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆286Updated 2 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Updated 5 years ago
- End-to-End Neural Diarization☆421Updated 4 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆146Updated 3 years ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆219Updated 2 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆274Updated 3 years ago
- feature extraction from speech signals☆390Updated 7 months ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆220Updated 2 years ago
- Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch☆106Updated 5 years ago
- ☆42Updated 5 years ago
- Official implement of SpeechFormer written in Python (PyTorch).☆78Updated 2 years ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆379Updated last year
- ☆176Updated last year
- Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)☆442Updated 2 years ago
- An open source dataset for source separation☆473Updated 2 years ago