iCVTEAM / M3TRLinks
M3TR: Multi-modal Multi-label Recognition with Transformer. ACM MM 2021
☆16Updated 4 years ago
Alternatives and similar repositories for M3TR
Users that are interested in M3TR are comparing it to the libraries listed below
Sorting:
- The official implementation for ALOFT (CVPR 2023).☆57Updated 2 years ago
- ☆10Updated 3 years ago
- [PR 2022, Highly Cited Paper] Learning Attention-Guided Pyramidal Features for Few-shot Fine-grained Recognition☆17Updated 3 years ago
- ☆28Updated 3 years ago
- 2021 AAAI Modular Graph Transformer Networks for Multi-Label Image Classification; Official GitHub: https://github.com/ReML-AI/MGTN☆21Updated 4 years ago
- [BMVC 2021] The official PyTorch implementation of Feature Fusion Vision Transformer for Fine-Grained Visual Categorization☆49Updated 3 years ago
- A PyTorch implementation of CMT based on paper CMT: Convolutional Neural Networks Meet Vision Transformers.☆72Updated 2 years ago
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Updated 4 years ago
- [MIPR 2022 & TMM 2023] "Attentive Graph Neural Networks for Few-shot Learning" with its extension version☆16Updated 2 years ago
- ☆152Updated last year
- PyTorch Implementation of Deep Equilibrium Multimodal Fusion☆21Updated 2 years ago
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆55Updated 3 years ago
- PyTorch implementation of Deep Semantic Dictionary Learning for Multi-label Image Classification, AAAI 2021.☆50Updated 4 years ago
- [TIP] Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition☆45Updated 2 years ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆168Updated 3 years ago
- CMT: Convolutional Neural Networks Meet Vision Transformers☆122Updated 4 years ago
- ☆157Updated last year
- [ECCV 2022] LAFF for Text-to-Video Retrieval☆46Updated 2 years ago
- Transformer-based Dual Relation Graph for Multi-label Image Recognition. ICCV 2021☆50Updated 3 years ago
- ☆31Updated 3 years ago
- This repo shows the source code of IEEE TGRS 2022 article: Sonar Images Classification While Facing Long-Tail and Few-Shot.☆17Updated 2 years ago
- ☆149Updated last year
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆221Updated 7 months ago
- Convolutional Fine-Grained Classification with Self-Supervised Target Relation Regularization (TIP 2022)☆12Updated 3 years ago
- Visual Transformers with Primal Object Queries for Multi-Label Image Classification☆12Updated 3 years ago
- 对卷积神经网络提取的每一层特征用t-SNE进行降维可视化☆22Updated 4 years ago
- Modular Graph Transformer Networks☆23Updated 4 years ago
- Implementation of vision transformer. ⭐⭐⭐☆33Updated 4 years ago
- Vision Transformers with Hierarchical Attention☆102Updated 5 months ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆55Updated 2 years ago