Code for "Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations" (CVPR 2024 Oral)
☆18Jun 23, 2024Updated last year
Alternatives and similar repositories for MMSI
Users that are interested in MMSI are comparing it to the libraries listed below
Sorting:
- [ECCV2024] Nonverbal Interaction Detection☆29Oct 30, 2024Updated last year
- Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models☆25Sep 30, 2025Updated 5 months ago
- Official pyTorch implementation of Transformer-based PAUP model for sequential recommentation, SIGIR 2022☆10Sep 8, 2022Updated 3 years ago
- [ICCV 2023] Official PyTorch Implementation for "Mitigating Adversarial Vulnerability through Causal Parameter Estimation by Adversarial …☆31Oct 13, 2023Updated 2 years ago
- Official repository for the paper “MME-Emotion: A Holistic Evaluation Benchmark for Emotional Intelligence in Multimodal Large Language M…☆21Jan 17, 2026Updated last month
- Official PyTorch Implementation Code for Developing Super Fast Adversarial Training with Distributed Data Parallel, Channel Last Memory F…☆33Mar 13, 2023Updated 2 years ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- [ICLR 2025] Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting☆14Nov 24, 2025Updated 3 months ago
- [CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation☆45Sep 6, 2024Updated last year
- SVM classifiers built for emotion classification☆10Apr 27, 2016Updated 9 years ago
- This is the official implementation of the ICML 2023 paper "Fair yet Asymptotically Equal Collaborative Learning"☆10May 29, 2023Updated 2 years ago
- ☆12Jun 2, 2025Updated 9 months ago
- [CVPR 2022] Sequential Voting with Relational Box Fields for Active Object Detection☆10Jun 19, 2022Updated 3 years ago
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Mar 9, 2024Updated last year
- the implementation of TMNN. The paper is Dynamic Cardiac MRI Reconstruction Using Combined Tensor Nuclear Norm and Casorati Matrix Nuclea…☆11May 31, 2022Updated 3 years ago
- ☆14Mar 15, 2025Updated 11 months ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Official implementation for "Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling"☆10May 21, 2024Updated last year
- Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation, ACL 2024 (main)☆13Sep 23, 2024Updated last year
- ☆13May 21, 2024Updated last year
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- ☆12Nov 2, 2023Updated 2 years ago
- My slides and examples for bachelor deep learning course☆12Jun 2, 2022Updated 3 years ago
- splinter 中文文档☆11Dec 4, 2017Updated 8 years ago
- MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering…☆13Feb 18, 2023Updated 3 years ago
- [ECCV 2024] Official PyTorch implementation of "Classification Matters: Improving Video Action Detection with Class-Specific Attention"