This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the information about recent multimodal datasets which are available for research purposes. We found that although 100+ multimodal language resources are availab…
☆334Jan 10, 2022Updated 4 years ago
Alternatives and similar repositories for Multimodal-datasets
Users that are interested in Multimodal-datasets are comparing it to the libraries listed below
Sorting:
- ☆183Mar 20, 2020Updated 6 years ago
- ☆17Oct 2, 2024Updated last year
- Fine grained Empathy Direction Detection☆16Dec 11, 2020Updated 5 years ago
- Reading list for research topics in multimodal machine learning☆6,843Aug 20, 2024Updated last year
- Multimodal Sarcasm Detection Dataset☆369Aug 22, 2024Updated last year
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- Scripts to evaluate various bias metrics for different NLG models + decoding algorithms☆16Dec 6, 2023Updated 2 years ago
- Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…☆25Dec 20, 2024Updated last year
- code for "Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation, EMNLP 22"☆80Feb 9, 2023Updated 3 years ago
- Code Repository For ACL2021 Paper - DynaEval: Unifying Turn and Dialogue Level Evaluation☆13Sep 2, 2022Updated 3 years ago
- ☆15Mar 20, 2025Updated last year
- IRFL: Image Recognition of Figurative Language☆11Nov 30, 2023Updated 2 years ago
- Original PyTorch Implementation for the EMNLP 2023 Paper "Beyond Detection: A Defend-and-Summarize Strategy for Robust and Interpretable …☆16Dec 14, 2023Updated 2 years ago
- [ACL 2022] The source code of Multi-Modal Sarcasm Detection via Cross-Modal Graph Convolutional Network☆40Mar 20, 2023Updated 3 years ago
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆21Jun 12, 2023Updated 2 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- Multimodal Model for Memotion Dataset☆12May 17, 2021Updated 4 years ago
- This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as mul…☆908Mar 15, 2023Updated 3 years ago
- ☆15Dec 10, 2021Updated 4 years ago
- CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AI | 中文个性情感对话数据集☆273Nov 10, 2022Updated 3 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- HD-EPIC Python script to download the entire datasets or parts of it☆17Oct 7, 2025Updated 5 months ago
- ☆32Jan 28, 2026Updated last month
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆108Sep 25, 2022Updated 3 years ago
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆84Jun 16, 2025Updated 9 months ago
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 2 years ago
- [KDD 2023] FedMultimodal: A Benchmark For Multimodal Federated Learning☆140May 24, 2025Updated 9 months ago
- Official pytorch implementation of MuST: Multi-Scale Transformers for Surgical Phase Recognition MICCAI 2024☆15Jan 13, 2025Updated last year
- A Universal Platform for Training and Evaluation of Mobile Interaction☆61Sep 24, 2025Updated 5 months ago
- MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation☆1,015Mar 10, 2024Updated 2 years ago
- The data and code for NumerSense (EMNLP2020)☆19May 8, 2023Updated 2 years ago
- Humor Knowledge Enriched Transformer☆32Oct 20, 2021Updated 4 years ago
- This is the code for my master thesis.☆19Aug 18, 2022Updated 3 years ago
- ☆25Aug 1, 2024Updated last year
- Question-Directed Graph Attention Network for Numerical Reasoning over Text☆10Aug 14, 2020Updated 5 years ago
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Apr 28, 2023Updated 2 years ago
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆38Dec 19, 2024Updated last year
- Latest Advances on Multimodal Large Language Models☆17,466Mar 12, 2026Updated last week
- PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)☆23Nov 29, 2022Updated 3 years ago