Sunner4nwpu / TEMMA
Multi-modal fusion framework based on Transformer Encoder
☆14Updated 4 years ago
Alternatives and similar repositories for TEMMA
Users that are interested in TEMMA are comparing it to the libraries listed below
Sorting:
- ☆14Updated 3 years ago
- Detect Depression with AI Sub-challenge (DSS) of AVEC2019 experienment version via YZK☆13Updated 3 years ago
- The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.☆120Updated 3 years ago
- Baseline scripts for the Audio/Visual Emotion Challenge 2019☆79Updated 3 years ago
- A survey of deep multimodal emotion recognition.☆52Updated 3 years ago
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆28Updated 5 months ago
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆38Updated 5 months ago
- Multimodal Fusion, Multimodal Sentiment Analysis☆23Updated 4 years ago
- Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition☆30Updated 4 years ago
- ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition☆45Updated last year
- The code repository for NAACL 2021 paper "Multimodal End-to-End Sparse Model for Emotion Recognition".☆103Updated 2 years ago
- AuxFormer: Robust Approach to Audiovisual Emotion Recognition☆14Updated 2 years ago
- "MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23☆19Updated 2 years ago
- Efficient Multimodal Transformer with Dual-Level Feature Restoration for Robust Multimodal Sentiment Analysis (TAC 2023)☆59Updated 7 months ago
- Reproduction of DepAudioNet by Ma et al. {DepAudioNet: An Efficient Deep Model for Audio based Depression Classification,(https://dl.acm.…☆76Updated 3 years ago
- PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition☆12Updated 3 years ago
- A Pytorch implementation of emotion recognition from videos☆18Updated 4 years ago
- the baseline model of CMDC corpus☆40Updated 2 years ago
- ☆28Updated 3 years ago
- We achieved the 2nd and 3rd places in ABAW3 and ABAW5, respectively.☆27Updated last year
- ☆17Updated 2 years ago
- Reproducing the baselines of the 2nd Multimodal Sentiment Analysis Challenge (MuSe 2021)☆40Updated 3 years ago
- ☆65Updated last year
- Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)☆12Updated 4 years ago
- Offical implementation of paper "MSAF: Multimodal Split Attention Fusion"