CheyneyComputerScience / CREMA-D
Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)
☆422Updated last month
Alternatives and similar repositories for CREMA-D:
Users that are interested in CREMA-D are comparing it to the libraries listed below
- This is the GitHub page for publicly available emotional speech data.☆345Updated 3 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆171Updated 11 months ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆328Updated 6 months ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆131Updated 3 months ago
- Python package for openSMILE☆275Updated 4 months ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆149Updated 3 years ago
- Repository with the code of the paper: A proposal for Multimodal Emotion Recognition using auraltransformers and Action Units on RAVDESS …☆104Updated last year
- TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18☆278Updated 10 months ago
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆227Updated last year
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆120Updated 4 years ago
- Audio-Visual Speech Separation with Cross-Modal Consistency☆229Updated last year
- ☆107Updated 2 years ago
- Multilingual datasets with raw audio for speech emotion recognition☆25Updated 3 years ago
- ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'☆370Updated last year
- Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)☆415Updated last year
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆666Updated 4 months ago
- Wav2Vec for speech recognition, classification, and audio classification☆262Updated 3 years ago
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆89Updated last year
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆73Updated last year
- ☆161Updated 9 months ago
- A multimodal approach on emotion recognition using audio and text.☆174Updated 4 years ago
- VGGSound: A Large-scale Audio-Visual Dataset☆314Updated 3 years ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆207Updated 2 years ago
- ☆110Updated 2 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆159Updated 5 years ago
- Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".☆256Updated last year
- ☆49Updated last year
- End-to-End Neural Diarization☆398Updated 3 years ago
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆143Updated last year
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆456Updated last year