SuperKogito / SER-datasetsLinks
A collection of datasets for the purpose of emotion recognition/detection in speech.
☆392Updated last year
Alternatives and similar repositories for SER-datasets
Users that are interested in SER-datasets are comparing it to the libraries listed below
Sorting:
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆139Updated 11 months ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆210Updated 3 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆153Updated 4 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆186Updated last year
- A multimodal approach on emotion recognition using audio and text.☆186Updated 5 years ago
- This is the GitHub page for publicly available emotional speech data.☆378Updated 3 years ago
- ☆111Updated 3 years ago
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆119Updated 4 years ago
- Multilingual datasets with raw audio for speech emotion recognition☆30Updated 4 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆269Updated 3 years ago
- Python package for openSMILE☆302Updated 2 months ago
- TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18☆298Updated last year
- Speaker embedding (d-vector) trained with GE2E loss☆286Updated last year
- ☆49Updated 2 years ago
- Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch☆105Updated 5 years ago
- Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)☆489Updated 9 months ago
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆93Updated 2 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆218Updated 2 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Updated 5 years ago
- End-to-End Neural Diarization☆418Updated 4 years ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆375Updated last year
- Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition☆80Updated 3 years ago
- Variational Bayes HMM over x-vectors diarization☆280Updated last year
- Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)☆436Updated 2 years ago
- ☆42Updated 5 years ago
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆238Updated last year
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 5 years ago
- feature extraction from speech signals☆388Updated 6 months ago
- Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch☆42Updated last year
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆44Updated 5 years ago