iCVTEAM / M3TRLinks
M3TR: Multi-modal Multi-label Recognition with Transformer. ACM MM 2021
☆15Updated 3 years ago
Alternatives and similar repositories for M3TR
Users that are interested in M3TR are comparing it to the libraries listed below
Sorting:
- The official implementation for ALOFT (CVPR 2023).☆56Updated 2 years ago
- PyTorch Implementation of Deep Equilibrium Multimodal Fusion☆21Updated 2 years ago
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Updated 4 years ago
- This repo includes the CUB-GHA (Gaze-based Human Attention) dataset and code of the paper "Human Attention in Fine-grained Classification…☆31Updated 3 years ago
- [CVPR' 23] Adjustment and Alignment for Unbiased Open Set Domain Adaptation☆21Updated 2 years ago
- A PyTorch implementation of CMT based on paper CMT: Convolutional Neural Networks Meet Vision Transformers.☆71Updated 2 years ago
- [MIPR 2022 & TMM 2023] "Attentive Graph Neural Networks for Few-shot Learning" with its extension version☆15Updated 2 years ago
- 2021 AAAI Modular Graph Transformer Networks for Multi-Label Image Classification; Official GitHub: https://github.com/ReML-AI/MGTN☆21Updated 4 years ago
- Official implementation of CVPR2023 paper "Bi-directional distribution alignment for transductive zero-shot learning""☆35Updated last year
- ☆10Updated 3 years ago
- End-to-End CLIP-driven Mamba Model for Multi-modal Fusion☆21Updated 4 months ago
- ☆152Updated last year
- AugTarget data augmentation for infrared small target detection.☆21Updated 2 years ago
- Vision Transformers with Hierarchical Attention☆102Updated last month
- Transformer-based Dual Relation Graph for Multi-label Image Recognition. ICCV 2021☆49Updated 3 years ago
- ☆68Updated last year
- The official implementation for DomainDrop (ICCV 2023).☆50Updated last year
- [TIP] Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition☆45Updated 2 years ago
- ☆147Updated last year
- ☆85Updated 2 years ago
- Official code release of our paper "EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention"☆21Updated last month
- ☆151Updated last year
- ☆32Updated 3 years ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆47Updated last year
- ☆26Updated 2 years ago
- ☆26Updated 2 years ago
- Official implementation of SPANet in ICCV2023☆23Updated 2 months ago
- Exploring Multi-Timestep Multi-Stage Diffusion Features for Hyperspectral Image Classification☆13Updated last year
- The official repository implement of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with…☆79Updated 2 months ago
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023☆17Updated 2 years ago