drmuskangarg / Multimodal-datasetsView external linksLinks
This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the information about recent multimodal datasets which are available for research purposes. We found that although 100+ multimodal language resources are availab…
☆330Jan 10, 2022Updated 4 years ago
Alternatives and similar repositories for Multimodal-datasets
Users that are interested in Multimodal-datasets are comparing it to the libraries listed below
Sorting:
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆14Apr 17, 2023Updated 2 years ago
- ☆17Oct 2, 2024Updated last year
- ☆182Mar 20, 2020Updated 5 years ago
- Reading list for research topics in multimodal machine learning☆6,809Aug 20, 2024Updated last year
- CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AI | 中文个性情感对话数据集☆268Nov 10, 2022Updated 3 years ago
- M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database. ACL 2022☆121Sep 24, 2022Updated 3 years ago
- Multimodal Model for Memotion Dataset☆12May 17, 2021Updated 4 years ago
- Recent Advances in Vision and Language Pre-training (VLP)☆295Jun 6, 2023Updated 2 years ago
- Multimodal Sarcasm Detection Dataset☆366Aug 22, 2024Updated last year
- ☆24Jan 28, 2026Updated 2 weeks ago
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆84Jun 16, 2025Updated 7 months ago
- code for "Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation, EMNLP 22"☆80Feb 9, 2023Updated 3 years ago
- Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…☆25Dec 20, 2024Updated last year
- Object recognition with Pepper using a deep learning model☆10Sep 16, 2021Updated 4 years ago
- Question-Directed Graph Attention Network for Numerical Reasoning over Text☆10Aug 14, 2020Updated 5 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- Code for the article "Shortcutted Commonsense: Data Spuriousness in Deep Learning of Commonsense Reasoning", Outstanding Paper at EMNLP20…☆10Nov 7, 2021Updated 4 years ago
- IRFL: Image Recognition of Figurative Language☆11Nov 30, 2023Updated 2 years ago
- Official repo of the paper Deep Regression Unlearning accepted in ICML 2023☆14Jun 14, 2023Updated 2 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- [ACL2023] Code and dataset for paper "MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System"☆52Jan 2, 2024Updated 2 years ago
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 2 years ago
- ☆25Aug 1, 2024Updated last year
- Summaries of machine learning papers☆12Aug 19, 2022Updated 3 years ago
- ☆10Aug 28, 2020Updated 5 years ago
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆108Sep 25, 2022Updated 3 years ago
- The code for the paper "Neutral Utterances are Also Causes: Enhancing Conversational Causal Emotion Entailment with Social Commonsense Kn…☆27May 22, 2022Updated 3 years ago
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- ORES: Open-vocabulary Responsible Visual Synthesis☆14Dec 12, 2023Updated 2 years ago
- Surrogate Model Extension (SME): A Fast and Accurate Weight Update Attack on Federated Learning [Accepted at ICML 2023]☆14Mar 31, 2024Updated last year
- Latest Advances on Multimodal Large Language Models☆17,337Updated this week
- A Universal Platform for Training and Evaluation of Mobile Interaction☆60Sep 24, 2025Updated 4 months ago
- This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as mul…☆905Mar 15, 2023Updated 2 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated last year
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- This repo contains information about FeB4RAG collection☆17Feb 19, 2024Updated last year
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- ☆12May 6, 2024Updated last year
- Findings of EMNLP 2023: InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspe…☆14Aug 13, 2024Updated last year