IliaZenkov / transformer-cnn-emotion-recognition
Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transformers, and everything in between
☆242Updated 4 years ago
Alternatives and similar repositories for transformer-cnn-emotion-recognition:
Users that are interested in transformer-cnn-emotion-recognition are comparing it to the libraries listed below
- Speech Emotion Recognition from raw speech signals using 1D CNN-LSTM☆104Updated 3 years ago
- A pytorch implementation of Speech emotion recognition using deep 1D & 2D CNN LSTM networks☆25Updated last year
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆198Updated 2 years ago
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆89Updated last year
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆38Updated 11 months ago
- How to detect emotions from speech using Bi-directional LSTM networks and attention mechanism in Keras.☆20Updated 11 months ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆168Updated 9 months ago
- Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET☆21Updated 2 years ago
- The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.☆116Updated 3 years ago
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆137Updated last year
- ☆104Updated 2 years ago
- The code ruproduced the emotion recognition model, 2D CNN LSTM networks, which based on <Speechemotionrecognitionusingdeep1D&2DCNNLSTMnet…☆23Updated 4 years ago
- A multimodal approach on emotion recognition using audio and text.☆170Updated 4 years ago
- A repository for emotion recognition from speech, text and mocap data from IEMOCAP dataset☆13Updated 6 years ago
- Time series course Fall 2019 project☆54Updated 4 years ago
- Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition☆73Updated 2 years ago
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆45Updated 2 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆145Updated 3 years ago
- Reproduction of DepAudioNet by Ma et al. {DepAudioNet: An Efficient Deep Model for Audio based Depression Classification,(https://dl.acm.…☆73Updated 3 years ago
- ☆48Updated last year
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆24Updated 3 years ago
- Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in…☆52Updated 3 years ago
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Updated last year
- This repository provides implementation for the paper "Self-attention fusion for audiovisual emotion recognition with incomplete data".☆124Updated 5 months ago
- ☆41Updated 4 years ago
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆117Updated 4 years ago
- TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18☆273Updated 8 months ago
- Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)☆11Updated 4 years ago
- In this project, the performance of speech emotion recognition is compared between two methods (SVM vs Bi-LSTM RNN).Conventional classifi…☆26Updated last year
- Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)☆410Updated last year