nhattruongpham / mmserLinks
SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
☆14Updated last year
Alternatives and similar repositories for mmser
Users that are interested in mmser are comparing it to the libraries listed below
Sorting:
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Updated 11 months ago
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆48Updated 3 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆22Updated 2 years ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆31Updated 2 years ago
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆40Updated last year
- SpeechFormer++ in PyTorch☆49Updated 2 years ago
- Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in…☆53Updated 4 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆65Updated last year
- A survey of deep multimodal emotion recognition.☆54Updated 3 years ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆39Updated last year
- CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis(MM2020)☆114Updated 5 years ago
- A multimodal approach on emotion recognition using audio and text.☆183Updated 5 years ago
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆55Updated 3 weeks ago
- ☆29Updated 3 years ago
- Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch☆41Updated last year
- This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …☆48Updated last year
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆119Updated 4 years ago
- FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION☆23Updated 10 months ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆24Updated 4 years ago
- The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.☆122Updated 4 years ago
- [EMNLP2023] Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction☆60Updated last year
- ☆48Updated last year
- Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"☆19Updated last year
- AuxFormer: Robust Approach to Audiovisual Emotion Recognition☆14Updated 2 years ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆75Updated last year
- Repository for code and paper submitted for APSIPA 2019, Lanzhou, China☆21Updated last year
- Chinese BERT classification with tf2.0 and audio classification with mfcc☆13Updated 4 years ago
- The code repository for NAACL 2021 paper "Multimodal End-to-End Sparse Model for Emotion Recognition".☆106Updated 2 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆206Updated 2 years ago
- Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…☆10Updated last year