kiva12138 / MIMRLView external linksLinks
The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning
☆18May 8, 2025Updated 9 months ago
Alternatives and similar repositories for MIMRL
Users that are interested in MIMRL are comparing it to the libraries listed below
Sorting:
- ☆13Jan 11, 2024Updated 2 years ago
- ☆13Apr 2, 2025Updated 10 months ago
- Code for MInD: Multimodal Information Disentanglement☆17Dec 17, 2025Updated last month
- ☆19Aug 22, 2024Updated last year
- ☆25Apr 16, 2025Updated 9 months ago
- The implementation of CubeMLP☆51May 8, 2023Updated 2 years ago
- The source code for the paper titled "Sentiment Knowledge Enhanced Attention Fusion Network (SKEAFN)".☆30Aug 17, 2023Updated 2 years ago
- "MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23☆23Feb 26, 2023Updated 2 years ago
- ☆27Apr 29, 2025Updated 9 months ago
- MultiEMO: An Attention-Based Correlation-Aware Multimodal Fusion Framework for Emotion Recognition in Conversations (ACL 2023)☆92Nov 17, 2023Updated 2 years ago
- A vision-language model with bidirectional progressive fusion and global-local alignment for enhanced medical image segmentation.☆17Dec 25, 2025Updated last month
- ☆40Apr 16, 2024Updated last year
- 多模态情绪识别方法研究(Multimodal Emotion Recognition)☆23Dec 29, 2025Updated last month
- ☆70Jul 25, 2024Updated last year
- [Findings of NAACL 2024] Emotion-Anchored Contrastive Learning Framework for Emotion Recognition in Conversation☆39Nov 23, 2024Updated last year
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆37Jan 29, 2025Updated last year
- [ACM MM2024] The code for HMLLM.☆11Oct 27, 2024Updated last year
- Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information☆11Sep 28, 2023Updated 2 years ago
- [IEEE TIP] Offical implementation for the work "BadCM: Invisible Backdoor Attack against Cross-Modal Learning".☆14Aug 30, 2024Updated last year
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆20Updated this week
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- A Fully End2End Multimodal System for Fast Yet Effective Video Emotion Recognition☆40Aug 12, 2024Updated last year
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆14May 26, 2025Updated 8 months ago
- ☆10Oct 16, 2025Updated 3 months ago
- ☆11Nov 11, 2022Updated 3 years ago
- Hypergraph Vision Transformers: Images are More than Nodes, More than Edges☆17Jul 25, 2025Updated 6 months ago
- Official implementation of the paper "M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding"☆20Jan 14, 2026Updated 3 weeks ago
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)☆12Mar 6, 2025Updated 11 months ago
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated last year
- Official repository for ACM Multimedia'24 paper "MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube a…☆17Aug 11, 2024Updated last year
- Awesome Multimodal Fusion in Speech Emotion Recognition☆13Nov 11, 2025Updated 3 months ago
- A lecture summarization tool that uses AI and computer vision to summarize and index videos☆11Dec 8, 2022Updated 3 years ago
- Substitute alternative spellings of special characters (e.g. German umlauts [ae, oe, ue] and [ss]) with their correct versions (ä, ö, ü, …☆11Nov 24, 2024Updated last year
- ☆16Aug 15, 2024Updated last year
- Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021☆11Aug 24, 2021Updated 4 years ago
- ☆13Oct 17, 2020Updated 5 years ago
- Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…☆11Jun 19, 2024Updated last year
- This is the repo for "Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition", CVPR2025.☆20Dec 22, 2025Updated last month
- ☆10Jan 18, 2024Updated 2 years ago