sunlicai / HiCMAELinks
[Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
☆118Updated 4 months ago
Alternatives and similar repositories for HiCMAE
Users that are interested in HiCMAE are comparing it to the libraries listed below
Sorting:
- GPT-4V with Emotion☆97Updated 2 years ago
- This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …☆48Updated last year
- Toolkits for Multimodal Emotion Recognition☆270Updated 7 months ago
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆55Updated 2 months ago
- A Unimodal Valence-Arousal Driven Contrastive Learning Framework for Multimodal Multi-Label Emotion Recognition (ACM MM 2024 oral)☆25Updated last year
- [ACM ICMR'25]Official repository for "eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos"☆36Updated 5 months ago
- Explainable Multimodal Emotion Reasoning (EMER), OV-MER (ICML), and AffectGPT (ICML, Oral)☆312Updated 4 months ago
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆38Updated 8 months ago
- This is the official implementation of 2023 ICCV paper "EmoSet: A large-scale visual emotion dataset with rich attributes".☆62Updated last year
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆58Updated last year
- [EMNLP2023] Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction☆60Updated last year
- NeurIPS'2023 official implementation code☆68Updated 2 years ago
- [BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognition☆139Updated last year
- [CVPR 2023] Code for "Learning Emotion Representations from Verbal and Nonverbal Communication"☆54Updated 10 months ago
- ☆23Updated last year
- Official repository for our CVPR 2024 Workshop paper "Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition".☆25Updated 11 months ago
- ☆26Updated 8 months ago
- av-SALMONN: Speech-Enhanced Audio-Visual Large Language Models☆13Updated last year
- Official PyTorch implementation for "MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens…☆43Updated 6 months ago
- The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"☆24Updated 2 years ago
- FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION☆23Updated last year
- EmoLLM: Multimodal Emotional Understanding Meets Large Language Models☆19Updated last year
- ☆23Updated 8 months ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆67Updated last year
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆78Updated last year
- SpeechFormer++ in PyTorch☆49Updated 2 years ago
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Updated last year
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆57Updated last year
- [CVPR 2023] This is the official implementation of "Weakly Supervised Video Emotion Detection and Prediction via Cross-Modal Temporal Era…☆40Updated 11 months ago
- We achieved the 2nd and 3rd places in ABAW3 and ABAW5, respectively.☆31Updated last year