mobulan / MPSALinks
Source code of the paper Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification
☆31Updated 7 months ago
Alternatives and similar repositories for MPSA
Users that are interested in MPSA are comparing it to the libraries listed below
Sorting:
- Source code of the paper Fine-Grained Visual Classification via Internal Ensemble Learning Transformer☆51Updated last year
- Pytorch implementation of "Fine-grained Visual Classification with High-temperature Refinement and Background Suppression"☆108Updated last year
- [Pattern Recognition] Mix-ViT: Mixing Attentive Vision Transformer for Ultra-Fine-Grained Visual Categorization.☆21Updated last year
- ☆77Updated last year
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆47Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆198Updated last year
- [ECCV 2024] Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation☆116Updated last month
- ☆145Updated last year
- The official implementation for ICCV'23 paper "Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning"☆152Updated last year
- [TPAMI 2025] Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection☆33Updated last month
- ☆52Updated last year
- ☆27Updated last year
- A benchmark for cross-domain few-shot object detection (ECCV24 paper: Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object…☆161Updated 5 months ago
- The official repository implement of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with…☆73Updated 3 weeks ago
- [TNNLS 2025] TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆214Updated 2 months ago
- [IGARSS 2024] Code for "CLIP-Guided Source-Free Object Detection in Aerial Images"☆27Updated 8 months ago
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆321Updated 7 months ago
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆114Updated 5 months ago
- ☆12Updated last year
- [CVPR 2025] SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation☆136Updated this week
- CVPR2024☆85Updated 5 months ago
- [CVPR 2025 🔥] MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism☆31Updated 3 months ago
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆89Updated 2 years ago
- [CVPR2024] Distribution-aware Knowledge Prototyping for Non-exemplar Lifelong Person Re-identification☆15Updated last month
- [AAAI 2025] SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks☆38Updated 2 months ago
- Official implement of CVPR2025 paper: "T2ICount: Enhancing Cross-modal Understanding for zero-shot Counting"☆18Updated 4 months ago
- Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)"☆16Updated 2 years ago
- ☆19Updated 9 months ago
- GroupMixAttention and GroupMixFormer☆117Updated last year
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆210Updated 2 years ago