sunlicai / HiCMAELinks

[Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition

☆118

Alternatives and similar repositories for HiCMAE

Users that are interested in HiCMAE are comparing it to the libraries listed below

Sorting:

zeroQiaoba / gpt4v-emotion
GPT-4V with Emotion
☆97Updated 2 years ago
katerynaCh / MMA-DFER
This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …
☆48Updated last year
zeroQiaoba / MERTools
Toolkits for Multimodal Emotion Recognition
☆270Updated 7 months ago
Strong-AI-Lab / emotion
Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,
☆55Updated 2 months ago
NUSTM / UniVA
A Unimodal Valence-Arousal Driven Contrastive Learning Framework for Multimodal Multi-Label Emotion Recognition (ACM MM 2024 oral)
☆25Updated last year
XuecWu / eMotions
[ACM ICMR'25]Official repository for "eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos"
☆36Updated 5 months ago
zeroQiaoba / AffectGPT
Explainable Multimodal Emotion Reasoning (EMER), OV-MER （ICML), and AffectGPT （ICML, Oral)
☆312Updated 4 months ago
aimmemotion / EmoVIT
[CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning
☆38Updated 8 months ago
JingyuanYY / EmoSet
This is the official implementation of 2023 ICCV paper "EmoSet: A large-scale visual emotion dataset with rich attributes".
☆62Updated last year
katha-ai / EmoTx-CVPR2023
[CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…
☆58Updated last year
leson502 / CORECT_EMNLP2023
[EMNLP2023] Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction
☆60Updated last year
haoyi-duan / DG-SCT
NeurIPS'2023 official implementation code
☆68Updated 2 years ago
zengqunzhao / DFER-CLIP
[BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognition
☆139Updated last year
Xeaver / EmotionCLIP
[CVPR 2023] Code for "Learning Emotion Representations from Verbal and Nonverbal Communication"
☆54Updated 10 months ago
AI-S2-Lab / MEIJU2025-baseline
☆23Updated last year
tub-cv-group / conclugen
Official repository for our CVPR 2024 Workshop paper "Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition".
☆25Updated 11 months ago
MC-EIU / MC-EIU
☆26Updated 8 months ago
the-anonymous-bs / av-SALMONN
av-SALMONN: Speech-Enhanced Audio-Visual Large Language Models
☆13Updated last year
JeongHun0716 / MMS-LLaMA
Official PyTorch implementation for "MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens…
☆43Updated 6 months ago
GeWu-Lab / MMCosine_ICASSP23
The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
☆24Updated 2 years ago
ASolitaryMan / HFLEA
FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION
☆23Updated last year
yan9qu / EmoLLM
EmoLLM: Multimodal Emotional Understanding Meets Large Language Models
☆19Updated last year
chengzju / CARAT
☆23Updated 8 months ago
scutcsq / DWFormer
DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)
☆67Updated last year
Sreyan88 / MMER
Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition
☆78Updated last year
HappyColor / SpeechFormer2
SpeechFormer++ in PyTorch
☆49Updated 2 years ago
praveena2j / RecurrentJointAttentionwithLSTMs
ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"
☆14Updated last year
rikeilong / Bay-CAT
[ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…
☆57Updated last year
nku-zhichengzhang / CTEN
[CVPR 2023] This is the official implementation of "Weakly Supervised Video Emotion Detection and Prediction via Cross-Modal Temporal Era…
☆40Updated 11 months ago
sucv / ABAW3
We achieved the 2nd and 3rd places in ABAW3 and ABAW5, respectively.
☆31Updated last year