Tim-101 / Text-and-Image-Classification
Classify image and text with ResNet and BERT models using Pytorch
☆13Updated 4 years ago
Alternatives and similar repositories for Text-and-Image-Classification:
Users that are interested in Text-and-Image-Classification are comparing it to the libraries listed below
- The code for "Does Head Label Help for Long-Tailed Multi-Label Text Classific"☆30Updated 4 years ago
- Feature Projection for Improved Text Classification☆45Updated 5 years ago
- ☆13Updated 2 years ago
- [ACM MM 2021 Oral] Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation"☆40Updated 3 years ago
- Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answering…☆27Updated 2 years ago
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆71Updated last year
- Multi-modal Graph Fusion for Named Entity Recognition with Targeted Visual Guidance☆67Updated 6 months ago
- It is the implementation of paper "Multi-Modal Sarcasm Detection in Twitter with Hierarchical Fusion Model"☆15Updated 2 years ago
- Recent Advances in Visual Dialog☆30Updated 2 years ago
- ☆59Updated 3 years ago
- Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering☆18Updated 5 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Updated 4 years ago
- A Few-Shot Learning based Approach to Multimodal Social Relation Extraction☆13Updated 2 years ago
- ☆38Updated 2 years ago
- [ACL 2022] The source code of Multi-Modal Sarcasm Detection via Cross-Modal Graph Convolutional Network☆36Updated 2 years ago
- The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering☆19Updated 2 years ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Updated 4 years ago
- ☆22Updated 3 years ago
- Official implementation for the MM'22 paper.☆12Updated 2 years ago
- MultiSentiNet-CIKM2017☆21Updated 7 years ago
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Updated 3 years ago
- ☆15Updated last year
- ☆28Updated 2 years ago
- ☆8Updated 2 years ago
- ☆28Updated 2 years ago
- A GCN based visual question generation model☆13Updated 5 years ago
- The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"☆27Updated 3 years ago
- Source code for training Gated Multimodal Units on MM-IMDb dataset☆93Updated 2 years ago
- A repo for REMOD: relation extraction algorithm based on multimodality knowledge distillation☆28Updated 3 years ago
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Updated 3 years ago