[ICML 2025 Spotlight] MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding
☆67Jul 10, 2025Updated 8 months ago
Alternatives and similar repositories for MODA
Users that are interested in MODA are comparing it to the libraries listed below
Sorting:
- ☆10Apr 15, 2023Updated 2 years ago
- [ICCV 2023] This is the official implementation of "Multiple Planar Object Tracking"☆24Aug 19, 2023Updated 2 years ago
- [ACM MM 2022] This is the official implementation of "Temporal Sentiment Localization: Listen and Look in Untrimmed Videos"☆18Feb 14, 2025Updated last year
- [TNNLS 2023] This is official implementation of "PlaneSeg: Building a Plug-in for Boosting Planar Region Segmentation"☆24Aug 27, 2023Updated 2 years ago
- [ACM MM 2022 Oral] This is the official implementation of "SER30K: A Large-Scale Dataset for Sticker Emotion Recognition"☆29Oct 18, 2022Updated 3 years ago
- ☆16Oct 13, 2025Updated 4 months ago
- A new model for gait emotion recognition☆15Mar 22, 2024Updated last year
- Overworld's local world client interface to run Waypoint world models☆46Updated this week
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"☆20May 27, 2024Updated last year
- [WACV 2024] Code release for "VEATIC: Video-based Emotion and Affect Tracking in Context Dataset"☆21Jan 14, 2026Updated last month
- Efficient Two-Step Networks for Temporal Action Segmentation (Neurocomputing 2021)☆17Dec 6, 2021Updated 4 years ago
- The official implementation of the paper "DIP: Dual Incongruity Perceiving Network for Sarcasm Detection"☆37Dec 6, 2024Updated last year
- End2End Virtual Try-on with Visual Reference, CVPR2026☆58Nov 19, 2025Updated 3 months ago
- (CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization☆20Jun 11, 2024Updated last year
- a unified and simple codebase for weakly-supervised temporal action localization☆19Sep 30, 2023Updated 2 years ago
- Code of our Neurips2020 paper "Auto Learning Attention", coming soon☆22Apr 14, 2021Updated 4 years ago
- [ICML'25 Spotlight] Catch Your Emotion: Sharpening Emotion Perception in Multimodal Large Language Models☆46Jan 21, 2026Updated last month
- [ICCV 2025] Official repository of DiffSim: Taming Diffusion Models for Evaluating Visual Similarity☆30Jul 14, 2025Updated 7 months ago
- Pose Extraction & Rendering for SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representat…☆184Dec 28, 2025Updated 2 months ago
- A large-scale dataset for classification and detection of apple leaf diseases☆12Apr 1, 2023Updated 2 years ago
- ICCV25 highlight☆51Jan 7, 2026Updated 2 months ago
- [ICLR26] GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆104Jan 27, 2026Updated last month
- [ICLR 2025] A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆90Feb 2, 2026Updated last month
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆29Feb 18, 2026Updated 2 weeks ago
- ☆55Updated this week
- [ICRA 2026] StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes☆20Feb 17, 2026Updated 3 weeks ago
- ☆40Apr 16, 2024Updated last year
- ☆66Feb 23, 2026Updated 2 weeks ago
- CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework" 🔥☆102Feb 22, 2026Updated 2 weeks ago
- Open Source Neural Architecture Search Toolbox for Device-aware Image Dense Prediction & Official implementation of ICCV2021 "iNAS: Integ…☆84Apr 11, 2022Updated 3 years ago
- ☆18Feb 16, 2025Updated last year
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆51Sep 22, 2025Updated 5 months ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- Community maintained hardware plugin for vLLM on AWS Neuron☆24Feb 26, 2026Updated last week
- 中国矿业大学本科毕业论文word模板2023版☆12Mar 29, 2023Updated 2 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- ☆11Oct 31, 2024Updated last year
- ☆23Dec 11, 2025Updated 2 months ago
- Transferring Genshin PVs into a freehand style with Diffusion Model.☆10Jun 5, 2024Updated last year