Markin-Wang / MixViTLinks
[Pattern Recognition] Mix-ViT: Mixing Attentive Vision Transformer for Ultra-Fine-Grained Visual Categorization.
☆22Updated 2 years ago
Alternatives and similar repositories for MixViT
Users that are interested in MixViT are comparing it to the libraries listed below
Sorting:
- Source code of the paper Fine-Grained Visual Classification via Internal Ensemble Learning Transformer☆55Updated last year
- CVPR2024☆104Updated 10 months ago
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆78Updated last year
- PA-SAM: Prompt Adapter SAM for High-quality Image Segmentation☆97Updated last year
- This repository lists some awesome public projects about Zero-shot/Few-shot Learning based on CLIP (Contrastive Language-Image Pre-Traini…☆29Updated last year
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆46Updated last year
- [CVPR 2023] Token Contrast for Weakly-Supervised Semantic Segmentation☆178Updated 2 years ago
- Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation (CVPR 2024)☆46Updated last year
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆56Updated 5 months ago
- Pytorch implementation of "Fine-grained Visual Classification with High-temperature Refinement and Background Suppression"☆114Updated 2 years ago
- ☆112Updated last year
- Official code for the NeurIPS 2023 paper "Switching Temporary Teachers for Semi-Supervised Semantic Segmentation"☆51Updated 2 years ago
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆130Updated 10 months ago
- ☆149Updated last year
- ☆83Updated 2 years ago
- Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)"☆17Updated 2 years ago
- ☆27Updated last year
- [CVPR 2023] CLIP is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation☆210Updated last year
- Code for Part-Guided Relational Transformers for Fine-Grained Visual Recognition, appeared in TIP 2021☆22Updated 2 years ago
- Official PyTorch Implementation of DIaM in "A Strong Baseline for Generalized Few-Shot Semantic Segmentation" (CVPR 2023)☆73Updated last year
- ☆30Updated 2 years ago
- ☆45Updated 2 years ago
- Official Code of SATS: Self-Attention Transfer for Continual Semantic Segmentation☆25Updated 2 years ago
- Hybrid Mamba for Few-Shot Segmentation (NIPS 2024)☆41Updated last year
- Vision and Language Reference Prompt into SAM for Few-shot Segmentation☆29Updated 9 months ago
- Official pytorch implementation of ZiRa, a method for incremental vision language object detection (IVLOD),which has been accepted by Neu…☆29Updated last year
- This is an official implementation for [ICLR'24] INTR: Interpretable Transformer for Fine-grained Image Classification.☆57Updated last year
- ☆36Updated 2 years ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆211Updated last year
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆91Updated 2 years ago