sunlicai / HiCMAELinks
[Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
☆114Updated 2 months ago
Alternatives and similar repositories for HiCMAE
Users that are interested in HiCMAE are comparing it to the libraries listed below
Sorting:
- GPT-4V with Emotion☆96Updated last year
- This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …☆48Updated last year
- Toolkits for Multimodal Emotion Recognition☆260Updated 6 months ago
- av-SALMONN: Speech-Enhanced Audio-Visual Large Language Models☆13Updated last year
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆55Updated last month
- Explainable Multimodal Emotion Reasoning (EMER), OV-MER (ICML), and AffectGPT (ICML, Oral)☆286Updated 3 months ago
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆57Updated last year
- ☆25Updated 6 months ago
- [ACM ICMR'25]Official repository for "eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos"☆35Updated 3 months ago
- ☆22Updated last year
- This is the official implementation of 2023 ICCV paper "EmoSet: A large-scale visual emotion dataset with rich attributes".☆57Updated last year
- [BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognition☆136Updated 11 months ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆65Updated last year
- [CVPR 2023] Code for "Learning Emotion Representations from Verbal and Nonverbal Communication"☆53Updated 9 months ago
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆76Updated 8 months ago
- [EMNLP2023] Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction☆60Updated last year
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆58Updated last year
- The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"☆24Updated 2 years ago
- ☆19Updated last year
- MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)☆120Updated 6 months ago
- Official repository for our CVPR 2024 Workshop paper "Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition".☆25Updated 10 months ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆76Updated last year
- ☆23Updated 7 months ago
- FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION☆23Updated 10 months ago
- NeurIPS'2023 official implementation code☆68Updated 2 years ago
- SpeechFormer++ in PyTorch☆49Updated 2 years ago
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Updated 11 months ago
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆36Updated 7 months ago
- Official PyTorch implementation for "MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens…☆38Updated 5 months ago
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆33Updated 11 months ago