iCVTEAM / M3TRLinks
M3TR: Multi-modal Multi-label Recognition with Transformer. ACM MM 2021
☆15Updated 4 years ago
Alternatives and similar repositories for M3TR
Users that are interested in M3TR are comparing it to the libraries listed below
Sorting:
- The official implementation for ALOFT (CVPR 2023).☆57Updated 2 years ago
- PyTorch Implementation of Deep Equilibrium Multimodal Fusion☆21Updated 2 years ago
- [MIPR 2022 & TMM 2023] "Attentive Graph Neural Networks for Few-shot Learning" with its extension version☆15Updated 2 years ago
- ☆10Updated 3 years ago
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Updated 4 years ago
- ☆26Updated 2 years ago
- End-to-End CLIP-driven Mamba Model for Multi-modal Fusion☆21Updated 5 months ago
- Official implementation of CVPR2023 paper "Bi-directional distribution alignment for transductive zero-shot learning""☆35Updated last year
- Code of ["Spectral Prompt Tuning: Unveiling Unseen Classes for Zero-Shot Semantic Segmentation"]☆14Updated last year
- [ICCV2023] "Vision HGNN: An Image is More than a Graph of Nodes" by Yan Han, Peihao Wang, Souvik Kundu, Ying Ding, and Zhangyang Wang☆59Updated 4 months ago
- ☆154Updated last year
- ☆151Updated last year
- ☆85Updated 2 years ago
- This repo includes the CUB-GHA (Gaze-based Human Attention) dataset and code of the paper "Human Attention in Fine-grained Classification…☆31Updated 3 years ago
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023☆17Updated 2 years ago
- ☆148Updated last year
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆47Updated last year
- ☆26Updated 2 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆55Updated 2 years ago
- ReViT - Residual Attention Vision Transformer☆33Updated last year
- Official code release of our paper "EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention"☆21Updated 2 months ago
- ☆68Updated last year
- ☆50Updated 3 years ago
- Unsupervised Domain Adaptive Salient Object Detection Through Uncertainty-Aware Pseudo-Label Learning, AAAI Conference on Artificial Inte…☆29Updated 2 years ago
- Code release for Scribble-attention Hierarchical Network for Weakly Supervised Salient Object Detection in Optical Remote Sensing Images.☆13Updated 2 years ago
- This repository is the code of the paper "Sparse Spatial Transformers for Few-Shot Learning" (SCIENCE CHINA Information Sciences).☆49Updated 2 years ago
- Towards Local Visual Modeling for Image Captioning☆29Updated 2 years ago
- Python code to implement DeIL, a CLIP based approach for open-world few-shot learning.☆19Updated last year
- AugTarget data augmentation for infrared small target detection.☆21Updated 2 years ago
- PyTorch code for Diffusion Mechanism in Neural Network: Theory and Applications☆40Updated last year