iCVTEAM / M3TRLinks
M3TR: Multi-modal Multi-label Recognition with Transformer. ACM MM 2021
☆15Updated 3 years ago
Alternatives and similar repositories for M3TR
Users that are interested in M3TR are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of Deep Equilibrium Multimodal Fusion☆20Updated last year
- The official implementation for ALOFT (CVPR 2023).☆54Updated last year
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Updated 3 years ago
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023☆17Updated last year
- ☆9Updated 3 years ago
- [MIPR 2022 & TMM 2023] "Attentive Graph Neural Networks for Few-shot Learning" with its extension version☆15Updated 2 years ago
- [CVPR' 23] Adjustment and Alignment for Unbiased Open Set Domain Adaptation☆19Updated 2 years ago
- The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024☆24Updated 10 months ago
- ReViT - Residual Attention Vision Transformer☆31Updated last year
- The official implementation for DomainDrop (ICCV 2023).☆48Updated last year
- IJCAI 2024, InfoMatch: Entropy neural estimation for semi-supervised image classification☆31Updated last year
- Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)"☆16Updated 2 years ago
- Official implementation of CVPR2023 paper "Bi-directional distribution alignment for transductive zero-shot learning""☆35Updated last year
- This repo includes the CUB-GHA (Gaze-based Human Attention) dataset and code of the paper "Human Attention in Fine-grained Classification…☆31Updated 2 years ago
- [PR 2022, Highly Cited Paper] Learning Attention-Guided Pyramidal Features for Few-shot Fine-grained Recognition☆17Updated 2 years ago
- ☆24Updated 2 years ago
- A PyTorch implementation of CMT based on paper CMT: Convolutional Neural Networks Meet Vision Transformers.☆70Updated 2 years ago
- Official code release of our paper "EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention"☆21Updated 7 months ago
- ☆19Updated 11 months ago
- Dual Adaptive Representation Alignment for Cross-domain Few-shot Learning TPAMI 2023☆20Updated last year
- Code for "From Instance to Metric Calibration: A Unified Framework for Open-World Few-Shot Learning" in TPAMI 2023.☆11Updated last year
- ☆26Updated 2 years ago
- The core code of AGCA: An Adaptive Graph Channel Attention Module for Steel Surface Defect Detection☆29Updated last year
- Code release for Scribble-attention Hierarchical Network for Weakly Supervised Salient Object Detection in Optical Remote Sensing Images.☆13Updated last year
- This repository is the code of the paper "Sparse Spatial Transformers for Few-Shot Learning" (SCIENCE CHINA Information Sciences).☆49Updated last year
- Code release for Bi-Directional Ensemble Network for Few-Shot Fine-Grained Classification.☆11Updated 2 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆49Updated last year
- AAAI-24 Decoupled Contrastive Learning for Long-Tailed Recognition☆27Updated last year
- Transformer-based Dual Relation Graph for Multi-label Image Recognition. ICCV 2021☆47Updated 2 years ago
- ☆33Updated 7 months ago