sunlicai / HiCMAELinks
[Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
☆114Updated last month
Alternatives and similar repositories for HiCMAE
Users that are interested in HiCMAE are comparing it to the libraries listed below
Sorting:
- GPT-4V with Emotion☆95Updated last year
- This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …☆46Updated last year
- Toolkits for Multimodal Emotion Recognition☆250Updated 4 months ago
- Explainable Multimodal Emotion Reasoning (EMER), OV-MER (ICML), and AffectGPT (ICML, Oral)☆259Updated 2 months ago
- This is the official implementation of 2023 ICCV paper "EmoSet: A large-scale visual emotion dataset with rich attributes".☆54Updated last year
- ☆23Updated 5 months ago
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆56Updated 11 months ago
- [EMNLP2023] Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction☆62Updated last year
- ☆22Updated 11 months ago
- [ACM ICMR'25]Official repository for "eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos"☆35Updated 2 months ago
- FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION☆23Updated 9 months ago
- A Unimodal Valence-Arousal Driven Contrastive Learning Framework for Multimodal Multi-Label Emotion Recognition (ACM MM 2024 oral)☆25Updated 11 months ago
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆36Updated 5 months ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆75Updated last year
- SpeechFormer++ in PyTorch☆48Updated 2 years ago
- [CVPR 2023] Code for "Learning Emotion Representations from Verbal and Nonverbal Communication"☆52Updated 7 months ago
- Official repository for our CVPR 2024 Workshop paper "Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition".☆24Updated 9 months ago
- av-SALMONN: Speech-Enhanced Audio-Visual Large Language Models☆13Updated last year
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆64Updated last year
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆56Updated 11 months ago
- EmoLLM: Multimodal Emotional Understanding Meets Large Language Models☆17Updated last year
- ☆21Updated 5 months ago
- We achieved the 2nd and 3rd places in ABAW3 and ABAW5, respectively.☆31Updated last year
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆56Updated last year
- NeurIPS'2023 official implementation code☆66Updated last year
- [BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognition☆132Updated 10 months ago
- A Facial Expression-Aware Multimodal Multi-task Learning Framework for Emotion Recognition in Multi-party Conversations (ACL 2023)☆70Updated 11 months ago
- ABAW6 (CVPR-W) We achieved second place in the valence arousal challenge of ABAW6☆29Updated last year
- [Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units☆44Updated 11 months ago
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆32Updated 10 months ago