iCVTEAM / M3TRLinks
M3TR: Multi-modal Multi-label Recognition with Transformer. ACM MM 2021
☆16Updated 4 years ago
Alternatives and similar repositories for M3TR
Users that are interested in M3TR are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of Deep Equilibrium Multimodal Fusion☆21Updated 2 years ago
- ☆152Updated last year
- The official implementation for ALOFT (CVPR 2023).☆57Updated 2 years ago
- A PyTorch implementation of CMT based on paper CMT: Convolutional Neural Networks Meet Vision Transformers.☆72Updated 2 years ago
- [MIPR 2022 & TMM 2023] "Attentive Graph Neural Networks for Few-shot Learning" with its extension version☆15Updated 2 years ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆168Updated 3 years ago
- 2021 AAAI Modular Graph Transformer Networks for Multi-Label Image Classification; Official GitHub: https://github.com/ReML-AI/MGTN☆21Updated 4 years ago
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Updated 4 years ago
- ☆46Updated 2 years ago
- ☆69Updated last year
- ☆50Updated 4 years ago
- Scattering Vision Transformer☆54Updated last year
- Vision Transformers with Hierarchical Attention☆102Updated 4 months ago
- [BMVC 2021] The official PyTorch implementation of Feature Fusion Vision Transformer for Fine-Grained Visual Categorization☆49Updated 3 years ago
- This repo is the official implementation of "*[Adaptive Frequency Filters As Efficient Global Token Mixers](https://arxiv.org/abs/2307.14…☆18Updated last year
- ☆10Updated 3 years ago
- ☆158Updated last year
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆55Updated 3 years ago
- CMT: Convolutional Neural Networks Meet Vision Transformers☆122Updated 4 years ago
- ☆85Updated 2 years ago
- Official code release of our paper "EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention"☆21Updated 4 months ago
- ☆28Updated 2 years ago
- [ICCV2023] "Vision HGNN: An Image is More than a Graph of Nodes" by Yan Han, Peihao Wang, Souvik Kundu, Ying Ding, and Zhangyang Wang☆61Updated 6 months ago
- This repository is the code of the paper "Sparse Spatial Transformers for Few-Shot Learning" (SCIENCE CHINA Information Sciences).☆49Updated 2 years ago
- ☆148Updated last year
- ☆31Updated 3 years ago
- Modular Graph Transformer Networks☆22Updated 4 years ago
- PyTorch implementation of Deep Semantic Dictionary Learning for Multi-label Image Classification, AAAI 2021.☆50Updated 4 years ago
- How Much Position Information Do Convolutional Neural Networks Encode?☆11Updated 4 years ago
- ☆36Updated 3 years ago