sunlicai / HiCMAELinks
 [Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
☆114Updated 2 months ago
Alternatives and similar repositories for HiCMAE
Users that are interested in HiCMAE are comparing it to the libraries listed below
Sorting:
- GPT-4V with Emotion☆95Updated last year
- This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …☆49Updated last year
- Toolkits for Multimodal Emotion Recognition☆254Updated 5 months ago
- [ACM ICMR'25]Official repository for "eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos"☆35Updated 3 months ago
- Explainable Multimodal Emotion Reasoning (EMER), OV-MER (ICML), and AffectGPT (ICML, Oral)☆271Updated 2 months ago
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆55Updated last week
- ☆24Updated 6 months ago
- EmoLLM: Multimodal Emotional Understanding Meets Large Language Models☆19Updated last year
- av-SALMONN: Speech-Enhanced Audio-Visual Large Language Models☆13Updated last year
- [CVPR 2023] Code for "Learning Emotion Representations from Verbal and Nonverbal Communication"☆53Updated 8 months ago
- [BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognition☆136Updated 11 months ago
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆58Updated last year
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆57Updated last year
- This is the official implementation of 2023 ICCV paper "EmoSet: A large-scale visual emotion dataset with rich attributes".☆54Updated last year
- FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION☆23Updated 10 months ago
- A Unimodal Valence-Arousal Driven Contrastive Learning Framework for Multimodal Multi-Label Emotion Recognition (ACM MM 2024 oral)☆25Updated 11 months ago
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆36Updated 6 months ago
- ☆22Updated last year
- [CVPR 2023] This is the official implementation of "Weakly Supervised Video Emotion Detection and Prediction via Cross-Modal Temporal Era…☆39Updated 9 months ago
- [EMNLP2023] Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction☆61Updated last year
- MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)☆115Updated 5 months ago
- Official repository for our CVPR 2024 Workshop paper "Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition".☆25Updated 9 months ago
- The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"☆24Updated 2 years ago
- We achieved the 2nd and 3rd places in ABAW3 and ABAW5, respectively.☆31Updated last year
- A Facial Expression-Aware Multimodal Multi-task Learning Framework for Emotion Recognition in Multi-party Conversations (ACL 2023)☆70Updated 11 months ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆75Updated last year
- ABAW6 (CVPR-W) We achieved second place in the valence arousal challenge of ABAW6☆30Updated last year
- ☆26Updated 2 years ago
- ☆22Updated 6 months ago
- SpeechFormer++ in PyTorch☆49Updated 2 years ago