artelab / Multi-modal-classification
This project contains the code of the implementation of the approach proposed in I. Gallo, A. Calefati, S. Nawaz and M.K. Janjua, "Image and Encoded Text Fusion for Multi-Modal Classification", DICTA2018, Canberra, Australia.
☆20Updated 5 years ago
Alternatives and similar repositories for Multi-modal-classification:
Users that are interested in Multi-modal-classification are comparing it to the libraries listed below
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Updated 4 years ago
- ☆19Updated 3 years ago
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Updated 3 years ago
- Philo: uniting modalities☆24Updated 4 years ago
- PyTorch implementation of Deep Semantic Dictionary Learning for Multi-label Image Classification, AAAI 2021.☆47Updated 3 years ago
- multimodal social media content (text, image) classification☆50Updated 2 years ago
- ☆58Updated 3 years ago
- Paper List about Radiology Report Generation and also some medical image captioning☆10Updated 3 years ago
- Scalable deep multimodal learning for cross-modal retrieval (SIGIR 2019, PyTorch Code)☆34Updated 4 years ago
- Multi-modal classifications of digits with image and audio modality. One shot learning with Siamese network is used to predict if the giv…☆15Updated last year
- The offical code for paper "Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking", ACM Multimedia 2019 Oral☆68Updated 5 years ago
- Gluon implementation of channel-attention modules: SE, ECA, GCT☆38Updated 4 years ago
- Source code for training Gated Multimodal Units on MM-IMDb dataset☆92Updated last year
- ☆42Updated 3 years ago
- Multi-Label Classification☆17Updated 6 years ago
- 2021 AAAI Modular Graph Transformer Networks for Multi-Label Image Classification; Official GitHub: https://github.com/ReML-AI/MGTN☆21Updated 3 years ago
- Jhhuangkay / DeepOpht-Medical-Report-Generation-for-Retinal-Images-via-Deep-Models-and-Visual-Explanation☆34Updated this week
- [Reproduce] Code for the ACL2019 paper "Multimodal Transformer for Unaligned Multimodal Language Sequences".☆23Updated 5 years ago
- Code release for Grad-CAM Guided Attention Module for Fine-grained Visual Classification (MLSP 2022)☆12Updated 3 years ago
- ☆12Updated 3 years ago
- [TIP] Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition☆44Updated last year
- Multimodal Compact Bilinear Pooling class in Python☆11Updated 5 years ago
- Bimodal and Unimodal Sentiment Analysis of Internet Memes (Image+Text)☆16Updated 3 years ago
- This repo includes the CUB-GHA (Gaze-based Human Attention) dataset and code of the paper "Human Attention in Fine-grained Classification…☆30Updated 2 years ago
- ☆11Updated 4 years ago
- The source code of 'Categorical Relation-Preserving Contrastive Knowledge Distillation for Medical Image Classification' (MICCAI 2021)☆17Updated 3 years ago
- Implementation of CVPR 2019 paper "Mfas: Multimodal fusion architecture search"☆77Updated 4 years ago
- Pytorch implementation for Deep Self-Learning From Noisy Labels☆34Updated 4 years ago
- An implementation of the Visual Transformer Architecture introduced in the paper "Visual Transformers: Token-based Image Representation a…☆17Updated 3 years ago
- codes for: Modality to Modality Translation: An Adversarial Representation Learning and Graph Fusion Network for Multimodal Fusion☆47Updated 3 years ago