MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)
☆128May 2, 2025Updated 9 months ago
Alternatives and similar repositories for MIntRec
Users that are interested in MIntRec are comparing it to the libraries listed below
Sorting:
- MIntRec2.0 is the first large-scale dataset for multimodal intent recognition and out-of-scope detection in multi-party conversations (IC…☆72Aug 13, 2025Updated 6 months ago
- TCL-MAP is a powerful method for multimodal intent recognition (AAAI 2024)☆56Jan 25, 2024Updated 2 years ago
- The first comprehensive multimodal language analysis benchmark for evaluating foundation models☆28Sep 22, 2025Updated 5 months ago
- Paper List for Dialogue and Interactive Systems☆15Jun 5, 2020Updated 5 years ago
- Deep Open Intent Classification with Adaptive Decision Boundary (AAAI 2021)☆78Feb 7, 2022Updated 4 years ago
- TEXTOIR is the first opensource toolkit for text open intent recognition. (ACL 2021)☆243Nov 26, 2025Updated 3 months ago
- Discovering New Intents via Constrained Deep Adaptive Clustering with Cluster Refinement (AAAI2020)☆46Dec 8, 2022Updated 3 years ago
- Papers for Open Knowledge Discovery☆120Dec 21, 2023Updated 2 years ago
- Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing☆24Dec 29, 2021Updated 4 years ago
- ☆26Apr 25, 2022Updated 3 years ago
- MMSA is a unified framework for Multimodal Sentiment Analysis.☆958Jan 15, 2025Updated last year
- ☆27Apr 29, 2025Updated 10 months ago
- 💭 Intentonomy: towards Human Intent Understanding [CVPR 2021]☆41Jul 22, 2021Updated 4 years ago
- Pytorch implementation for Tailor Versatile Multi-modal Learning for Multi-label Emotion Recognition☆65Nov 16, 2022Updated 3 years ago
- Sapsucker Woods 60 Audiovisual Dataset☆17Oct 7, 2022Updated 3 years ago
- Official code for HeterMPC (ACL 22) & MADNet (EMNLP 23) for Response Generation in Multi-Party Conversations☆14May 14, 2024Updated last year
- A Tool for extracting multimodal features from videos.☆206Feb 11, 2023Updated 3 years ago
- cross modal background suppression for audio-visual event localization☆36Mar 18, 2022Updated 3 years ago
- "Can images help recognize entities? A study of the role of images for Multimodal NER" (W-NUT at EMNLP 2021)☆21Nov 14, 2021Updated 4 years ago
- Code for paper "Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning"☆20Sep 6, 2021Updated 4 years ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024☆19Sep 29, 2024Updated last year
- ☆213Dec 5, 2021Updated 4 years ago
- This repository contains the implementation of the paper -- Bi-Bimodal Modality Fusion for Correlation-Controlled Multimodal Sentiment An…☆72Apr 16, 2023Updated 2 years ago
- [Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics☆37Jan 22, 2025Updated last year
- Pytorch implementation of Spiking Neural Networks for Human Activity Recognition.☆20Dec 11, 2022Updated 3 years ago
- Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)☆20Dec 6, 2022Updated 3 years ago
- Code for mixup contrastive learning☆21Mar 19, 2021Updated 4 years ago
- This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as mul…☆905Mar 15, 2023Updated 2 years ago
- Towards Robust Multimodal Sentiment Analysis with Incomplete Data☆104Updated this week
- Open source code for EMNLP 2020 Findings Paper "AGIF: An Adaptive Graph-Interactive Framework for Joint Multiple Intent Detection and Slo…☆89Dec 17, 2021Updated 4 years ago
- MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation☆1,006Mar 10, 2024Updated last year
- Code for paper: “What Data Benefits My Classifier?” Enhancing Model Performance and Interpretability through Influence-Based Data Selecti…☆23May 17, 2024Updated last year
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- Implementation of the research paper Consistent Representation Learning for Continual Relation Extraction (Findings of ACL 2022)☆26May 16, 2022Updated 3 years ago
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆59Feb 29, 2024Updated 2 years ago
- ☆96Nov 28, 2022Updated 3 years ago
- Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning☆525Nov 17, 2025Updated 3 months ago
- LogicCircuit is a program that helps build/simulate simple circuits using logic gates. It is meant to teach people the basics of how logi…☆10Feb 16, 2026Updated last week