fuyyyyy/SEPM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fuyyyyy/SEPM)

fuyyyyy / SEPM

[ICML'25 Spotlight] Catch Your Emotion: Sharpening Emotion Perception in Multimodal Large Language Models

☆57

Alternatives and similar repositories for SEPM

Users that are interested in SEPM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fuyyyyy / EMOE
View on GitHub
[CVPR'25] EMOE: Modality-Specific Enhanced Dynamic Emotion Experts
☆120Jul 12, 2025Updated last year
LiangJian24 / LoRASculpt
View on GitHub
[CVPR'25 Oral & TPAMI'26] LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Model…
☆54Aug 28, 2025Updated 10 months ago
WenkeHuang / Awesome-MLLM-Tuning
View on GitHub
Multimodal Large Language Model (MLLM) Tuning Survey: Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model
☆101Aug 5, 2025Updated 11 months ago
chengzju / CARAT
View on GitHub
☆25Apr 16, 2025Updated last year
zeroQiaoba / AffectGPT
View on GitHub
EMER, OV-MER (ICML25), AffectGPT (ICML25, Oral), EmoPrefer (ICLR26)
☆410Feb 24, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
201983290498 / lddu_mmer
View on GitHub
☆13Apr 2, 2025Updated last year
WenkeHuang / MAPO
View on GitHub
MAPO: MIXED ADVANTAGE POLICY OPTIMIZATION
☆35Sep 24, 2025Updated 9 months ago
thuiar / MMLA
View on GitHub
The first comprehensive multimodal language analysis benchmark for evaluating foundation models
☆32Sep 22, 2025Updated 10 months ago
ZhiyuanHan-Aaron / MoSEAR
View on GitHub
Benchmarking and Bridging Emotion Conflicts for Multimodal Emotion Reasoning (ACM MM 2025 Oral)
☆35Oct 11, 2025Updated 9 months ago
AZYoung233 / MSE-Adapter
View on GitHub
[AAAI-2025] The official Implement of MSE-Adapter
☆54Jul 16, 2025Updated last year
AvaMERG / AvaMERG-Pipeline
View on GitHub
☆19Jun 11, 2025Updated last year
yan9qu / CVPR25-MINE
View on GitHub
Repo for "Uncertain Multimodal Intention and Emotion Understanding in the Wild"
☆19Oct 20, 2025Updated 9 months ago
Cross-Innovation-Lab / PCDS
View on GitHub
source code for "Towards Speaker-Unknown Emotion Recognition in Conversation via Progressive Contrastive Deep Supervision"
☆11Nov 22, 2024Updated last year
KlingAIResearch / MODA
View on GitHub
[ICML 2025 Spotlight] MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding
☆68Jul 10, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
LuoMSen / KAN-MCP
View on GitHub
☆28Aug 3, 2025Updated 11 months ago
Lum1104 / EIBench
View on GitHub
(NeXD @ CVPR 2025) Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models
☆32Sep 30, 2025Updated 9 months ago
Flame-Chasers / DiaNA
View on GitHub
【CVPR 2025】Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment
☆39Sep 17, 2025Updated 10 months ago
ZhangqiJiang07 / middle_layers_indicating_hallucinations
View on GitHub
[CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…
☆84Oct 9, 2025Updated 9 months ago
gw-zhong / CMC
View on GitHub
Codes for "Calibrating Multimodal Consensus for Emotion Recognition".
☆19Oct 24, 2025Updated 8 months ago
zeroQiaoba / MERTools
View on GitHub
Toolkits for Multimodal Emotion Recognition
☆325Jun 5, 2026Updated last month
XuankunRong / SafeGRPO
View on GitHub
[CVPR'26] SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization
☆21Feb 19, 2026Updated 5 months ago
JingbiaoMei / RGCL
View on GitHub
📄 ACL 2024: RGCL, Retrieval-Guided Contrastive Learning for Hateful Meme Detection 📄 EMNLP 2025 (Oral): RA-HMD, Robust Adaptation of La…
☆40Mar 1, 2026Updated 4 months ago
zehuiwu / SpeechCueLLM
View on GitHub
☆31Feb 27, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
XL2248 / SOV-MAS
View on GitHub
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 3 years ago
NEU-DataMining / awesome-affective-computing
View on GitHub
A comprehensive overview of affective computing research in the era of large language models (LLMs).
☆33Aug 7, 2024Updated last year
XuecWu / eMotions
View on GitHub
[ACM ICMR'25]Official repository for "eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos"
☆38Jun 11, 2026Updated last month
chuyq / MESC
View on GitHub
☆42Mar 24, 2025Updated last year
ASolitaryMan / HFLEA
View on GitHub
FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION
☆23Dec 22, 2024Updated last year
PanoSent / PanoSent
View on GitHub
This repository hosts the code, data and model weight of PanoSent.
☆61Jul 8, 2025Updated last year
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
HumanMLLM / HumanOmni
View on GitHub
HumanOmni
☆240Mar 10, 2025Updated last year
MC-EIU / MC-EIU
View on GitHub
☆27Apr 29, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
THU-BPM / ICT
View on GitHub
Official repo for ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models
☆28Mar 24, 2025Updated last year
thuiar / TCL-MAP
View on GitHub
TCL-MAP is a powerful method for multimodal intent recognition (AAAI 2024)
☆60Jan 25, 2024Updated 2 years ago
shaokai1209 / MDSA
View on GitHub
[IEEE, TASLP, 2023] The code of the paper "Multi-Source Discriminant Subspace Alignment for Cross-Domain Speech Emotion Recognition".
☆19Sep 27, 2024Updated last year
XuankunRong / Awesome-LVLM-Safety
View on GitHub
A curated list of resources dedicated to the safety of Large Vision-Language Models. This repository aligns with our survey titled A Surv…
☆213May 25, 2026Updated last month
MANLP-suda / HHMPN
View on GitHub
Multi-modal Multi-label Emotion Recognition with Heterogeneous Hierarchical Message Passing
☆18Sep 24, 2022Updated 3 years ago
yuntaoshou / Awesome-Emotion-Reasoning
View on GitHub
Awesome-Emotion-Reasoning is a collection of Emotion-Reasoning works, including papers, codes and datasets
☆95Dec 16, 2025Updated 7 months ago
WenkeHuang / MarsFL
View on GitHub
TPAMI 2024 - Federated Learning for Generalization, Robustness, Fairness: A Survey and Benchmark
☆114Mar 16, 2026Updated 4 months ago