iCVTEAM / M3TRLinks
M3TR: Multi-modal Multi-label Recognition with Transformer. ACM MM 2021
☆15Updated 3 years ago
Alternatives and similar repositories for M3TR
Users that are interested in M3TR are comparing it to the libraries listed below
Sorting:
- [MIPR 2022 & TMM 2023] "Attentive Graph Neural Networks for Few-shot Learning" with its extension version☆15Updated 2 years ago
- The official implementation for ALOFT (CVPR 2023).☆56Updated 2 years ago
- PyTorch Implementation of Deep Equilibrium Multimodal Fusion☆20Updated 2 years ago
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Updated 4 years ago
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023☆17Updated 2 years ago
- ☆10Updated 3 years ago
- Official implementation of CVPR2023 paper "Bi-directional distribution alignment for transductive zero-shot learning""☆35Updated last year
- ☆26Updated 2 years ago
- [PR 2022, Highly Cited Paper] Learning Attention-Guided Pyramidal Features for Few-shot Fine-grained Recognition☆17Updated 2 years ago
- End-to-End CLIP-driven Mamba Model for Multi-modal Fusion☆21Updated 3 months ago
- [ICCV2023] "Vision HGNN: An Image is More than a Graph of Nodes" by Yan Han, Peihao Wang, Souvik Kundu, Ying Ding, and Zhangyang Wang☆57Updated 3 months ago
- [CVPR' 23] Adjustment and Alignment for Unbiased Open Set Domain Adaptation☆21Updated 2 years ago
- PyTorch implementation of Deep Semantic Dictionary Learning for Multi-label Image Classification, AAAI 2021.☆49Updated 4 years ago
- This repo shows the source code of IEEE TGRS 2022 article: Sonar Images Classification While Facing Long-Tail and Few-Shot.☆17Updated last year
- The official implementation for DomainDrop (ICCV 2023).☆50Updated last year
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆47Updated last year
- This repository is the code of the paper "Sparse Spatial Transformers for Few-Shot Learning" (SCIENCE CHINA Information Sciences).☆49Updated 2 years ago
- ☆25Updated 2 years ago
- ☆151Updated last year
- ☆151Updated last year
- [TIP] Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition☆45Updated 2 years ago
- ☆85Updated 2 years ago
- A PyTorch implementation of CMT based on paper CMT: Convolutional Neural Networks Meet Vision Transformers.☆71Updated 2 years ago
- This repo includes the CUB-GHA (Gaze-based Human Attention) dataset and code of the paper "Human Attention in Fine-grained Classification…☆31Updated 3 years ago
- Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)"☆16Updated 2 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆52Updated 2 years ago
- Official PyTorch Repository of "Task Discrepancy Maximization for Fine-grained Few-Shot Classification" (TDM, CVPR 2022 Oral Paper)☆44Updated last year
- [BMVC 2021] The official PyTorch implementation of Feature Fusion Vision Transformer for Fine-Grained Visual Categorization☆49Updated 2 years ago
- PyTorch code for Diffusion Mechanism in Neural Network: Theory and Applications☆40Updated last year
- ☆68Updated last year