mobulan / MPSALinks
Source code of the paper Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification
☆34Updated 8 months ago
Alternatives and similar repositories for MPSA
Users that are interested in MPSA are comparing it to the libraries listed below
Sorting:
- Source code of the paper Fine-Grained Visual Classification via Internal Ensemble Learning Transformer☆52Updated last year
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆47Updated last year
- Pytorch implementation of "Fine-grained Visual Classification with High-temperature Refinement and Background Suppression"☆108Updated last year
- ☆65Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆204Updated last year
- [Pattern Recognition] Mix-ViT: Mixing Attentive Vision Transformer for Ultra-Fine-Grained Visual Categorization.☆21Updated 2 years ago
- [TNNLS 2025] TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆217Updated 3 months ago
- CVPR2024☆89Updated 6 months ago
- [ECCV 2024] Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation☆119Updated 2 months ago
- [AAAI 2025] SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks☆39Updated 3 months ago
- ☆147Updated last year
- ☆80Updated last year
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆211Updated 2 years ago
- Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)"☆16Updated 2 years ago
- [CVPR 2025] SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation☆151Updated last month
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆118Updated 7 months ago
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆90Updated 2 years ago
- [TPAMI 2025] Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection☆34Updated 3 months ago
- ☆12Updated last year
- The official implementation for ICCV'23 paper "Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning"☆154Updated last year
- The official repository implement of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with…☆77Updated 2 months ago
- 可视化特征图教程☆75Updated 3 years ago
- GroupMixAttention and GroupMixFormer☆116Updated last year
- This is the official implementation for our CVPR2024 paper "Rethinking Prior Information Generation with CLIP for Few-Shot Segmentation".…☆46Updated last year
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆328Updated 8 months ago
- ☆152Updated last year
- [IGARSS 2024] Code for "CLIP-Guided Source-Free Object Detection in Aerial Images"☆27Updated 10 months ago
- [arXiv 2025] LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks☆79Updated 2 months ago
- Official repo for “Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and Visual Analysis Strategy”☆14Updated 10 months ago
- AFFNet-Unofficial Implementation☆15Updated 2 years ago