KlingAIResearch/MODA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KlingAIResearch/MODA)

KlingAIResearch / MODA

[ICML 2025 Spotlight] MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding

☆68

Alternatives and similar repositories for MODA

Users that are interested in MODA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PanchengZhao / Concealed-Dense-Prediction
View on GitHub
The official code and dataset of paper: Deep Learning in Concealed Dense Prediction
☆22Apr 18, 2025Updated last year
nku-zhichengzhang / MPOT
View on GitHub
[ICCV 2023] This is the official implementation of "Multiple Planar Object Tracking"
☆24Aug 19, 2023Updated 2 years ago
tinglyfeng / figure_for_data_analysis
View on GitHub
☆10Apr 15, 2023Updated 3 years ago
NK-JittorCV / nk-det
View on GitHub
An open source codebase for object detection based on Jittor
☆20Dec 9, 2025Updated 7 months ago
nku-zhichengzhang / PlaneSeg
View on GitHub
[TNNLS 2023] This is official implementation of "PlaneSeg: Building a Plug-in for Boosting Planar Region Segmentation"
☆27Aug 27, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nku-zhichengzhang / Awesome-emotion_llm_and_mllm
View on GitHub
Awesome papers for affective computing with llm and mllm
☆35Nov 26, 2025Updated 8 months ago
nku-zhichengzhang / MART
View on GitHub
[CVPR 2024] This is the official implementation of "MART: Masked Affective RepresenTation Learning via Masked Temporal Distribution Disti…
☆22Jun 14, 2025Updated last year
lyhisme / ETSN
View on GitHub
Efficient Two-Step Networks for Temporal Action Segmentation (Neurocomputing 2021)
☆17Dec 6, 2021Updated 4 years ago
KlingAIResearch / VidEmo
View on GitHub
[NeurIPS'25] VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models
☆15Dec 7, 2025Updated 7 months ago
exped1230 / S2-VER
View on GitHub
The official implement of paper S2-VER: Semi-Supervised Visual Emotion Recognition
☆11Apr 28, 2024Updated 2 years ago
wei-cheng777 / PS-Diffusion
View on GitHub
Official implementations for paper: PS-Diffusion: Photorealistic Subject-Driven Image Editing with Disentangled Control and Attention
☆24Oct 20, 2025Updated 9 months ago
HVision-NKU / DenseVLM
View on GitHub
[ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction
☆53Sep 22, 2025Updated 10 months ago
fuyyyyy / SEPM
View on GitHub
[ICML'25 Spotlight] Catch Your Emotion: Sharpening Emotion Perception in Multimodal Large Language Models
☆57Jan 21, 2026Updated 6 months ago
VINHYU / OpenSpatial
View on GitHub
☆94May 8, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Lum1104 / EIBench
View on GitHub
(NeXD @ CVPR 2025) Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models
☆32Sep 30, 2025Updated 9 months ago
Calvin11311 / Motion-Vector-Decomposition-Transformer
View on GitHub
MDT
☆25Mar 25, 2025Updated last year
exped1230 / BPM-GCN
View on GitHub
A new model for gait emotion recognition
☆15Mar 22, 2024Updated 2 years ago
lyhisme / DeST
View on GitHub
An official code for "A Decoupled Spatio-Temporal Framework for Skeleton-based Action Segmentation".
☆39Dec 15, 2023Updated 2 years ago
hukcc / Awesome-Video-Hallucination
View on GitHub
[ACL 2026] Paper list of Video LLM hallucination. Welcome to Star and Contribute!
☆36Jul 1, 2026Updated 3 weeks ago
linlany / MindtheGap
View on GitHub
ICCV25 highlight
☆59Jan 7, 2026Updated 6 months ago
nku-zhichengzhang / ExtDM
View on GitHub
[CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"
☆58Jun 24, 2025Updated last year
aimagelab / MissRAG
View on GitHub
[ICCV 2025] MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models
☆26May 12, 2026Updated 2 months ago
mims-harvard / Qworld
View on GitHub
Qworld: Question-Specific Evaluation Criteria for LLMs
☆31Mar 26, 2026Updated 4 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
HVision-NKU / OneVAE
View on GitHub
☆55Sep 21, 2025Updated 10 months ago
fuyyyyy / EMOE
View on GitHub
[CVPR'25] EMOE: Modality-Specific Enhanced Dynamic Emotion Experts
☆120Jul 12, 2025Updated last year
xinliu29 / NCMNet
View on GitHub
[TPAMI2024] NCMNet: Neighbor Consistency Mining Network for Two-View Correspondence Pruning. [CVPR2023] Progress…
☆57Mar 20, 2025Updated last year
lzyhha / HSSL
View on GitHub
Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)
☆15May 2, 2025Updated last year
HVision-NKU / AR123
View on GitHub
Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction' (ICCV 2025)
☆64Nov 8, 2025Updated 8 months ago
FishAndWasabi / Real-LOD
View on GitHub
Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"
☆34Apr 20, 2025Updated last year
VUT-HFUT / MAC_2024_baseline
View on GitHub
[MAC 2024] The baseline code for MAC 2024.
☆12Jun 3, 2025Updated last year
jinghan1he / VHR
View on GitHub
[ACL 2025] Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence
☆21Jun 10, 2025Updated last year
downdric / MSD
View on GitHub
The official implementation of the paper "DIP: Dual Incongruity Perceiving Network for Sarcasm Detection"
☆36Dec 6, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
scofield7419 / EmpathyEar
View on GitHub
Multimodal Empathetic Chatbot
☆55Jul 16, 2024Updated 2 years ago
HVision-NKU / ASID-Caption
View on GitHub
ASID-Caption: Attribute-Structured and Quality-Verified Audiovisual Instruction Dataset and Training Pipeline for Fine-Grained Video Unde…
☆68Mar 3, 2026Updated 4 months ago
ZhangLab-DeepNeuroCogLab / EmoEditor
View on GitHub
☆16Oct 13, 2025Updated 9 months ago
lvyiwei1 / DIME
View on GitHub
☆11Aug 20, 2024Updated last year
MCG-NKU / SERE
View on GitHub
Exploring Feature Self-relation for Self-supervised Transformer (TPAMI 2023)
☆21Apr 30, 2025Updated last year
Linxi-ZHAO / MARINE
View on GitHub
☆19Jun 6, 2025Updated last year
apple / ml-mobileclip-dr
View on GitHub
RayGen: Multi-Modal Dataset Reinforcement for MobileCLIP and MobileCLIP2
☆40Mar 12, 2026Updated 4 months ago