iCVTEAM / M3TRLinks
M3TR: Multi-modal Multi-label Recognition with Transformer. ACM MM 2021
☆15Updated 3 years ago
Alternatives and similar repositories for M3TR
Users that are interested in M3TR are comparing it to the libraries listed below
Sorting:
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Updated 4 years ago
- PyTorch Implementation of Deep Equilibrium Multimodal Fusion☆21Updated 2 years ago
- The official implementation for ALOFT (CVPR 2023).☆55Updated 2 years ago
- ☆10Updated 3 years ago
- [MIPR 2022 & TMM 2023] "Attentive Graph Neural Networks for Few-shot Learning" with its extension version☆15Updated 2 years ago
- The results and code of our IEEE TCYB 2022 paper, titled "Global-and-Local Collaborative Learning for Co-Salient Object Detection"☆12Updated 3 years ago
- ☆26Updated 2 years ago
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023☆17Updated 2 years ago
- Official implementation of CVPR2023 paper "Bi-directional distribution alignment for transductive zero-shot learning""☆35Updated last year
- [ECCV 2022] LAFF for Text-to-Video Retrieval☆45Updated last year
- Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection CVPR 2025☆16Updated 6 months ago
- End-to-End CLIP-driven Mamba Model for Multi-modal Fusion☆19Updated 2 months ago
- This repo shows the source code of IEEE TGRS 2022 article: Sonar Images Classification While Facing Long-Tail and Few-Shot.☆17Updated last year
- [PR 2022, Highly Cited Paper] Learning Attention-Guided Pyramidal Features for Few-shot Fine-grained Recognition☆17Updated 2 years ago
- Metal Surface Defect for Few-shot Classification Using Graph Embedding and Optimal Transport(IEEE TIM)☆16Updated 2 years ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆47Updated last year
- ☆150Updated last year
- [CVPR' 23] Adjustment and Alignment for Unbiased Open Set Domain Adaptation☆21Updated 2 years ago
- This repo includes the CUB-GHA (Gaze-based Human Attention) dataset and code of the paper "Human Attention in Fine-grained Classification…☆31Updated 3 years ago
- Transformer-based Dual Relation Graph for Multi-label Image Recognition. ICCV 2021☆47Updated 3 years ago
- Unsupervised Domain Adaptive Salient Object Detection Through Uncertainty-Aware Pseudo-Label Learning, AAAI Conference on Artificial Inte…☆28Updated 2 years ago
- Scattering Vision Transformer☆53Updated last year
- ☆49Updated 3 years ago
- ☆152Updated last year
- The official implementation for DomainDrop (ICCV 2023).☆49Updated last year
- Official PyTorch Repository of "Task Discrepancy Maximization for Fine-grained Few-Shot Classification" (TDM, CVPR 2022 Oral Paper)☆44Updated last year
- AugTarget data augmentation for infrared small target detection.☆21Updated 2 years ago
- [ICCV2023] "Vision HGNN: An Image is More than a Graph of Nodes" by Yan Han, Peihao Wang, Souvik Kundu, Ying Ding, and Zhangyang Wang☆57Updated 2 months ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆52Updated 2 years ago
- [TIP] Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition☆45Updated 2 years ago