Markin-Wang / MixViTLinks
[Pattern Recognition] Mix-ViT: Mixing Attentive Vision Transformer for Ultra-Fine-Grained Visual Categorization.
☆21Updated last year
Alternatives and similar repositories for MixViT
Users that are interested in MixViT are comparing it to the libraries listed below
Sorting:
- Source code of the paper Fine-Grained Visual Classification via Internal Ensemble Learning Transformer☆48Updated last year
- Pytorch implementation of "Fine-grained Visual Classification with High-temperature Refinement and Background Suppression"☆106Updated last year
- CVPR2024☆85Updated 4 months ago
- Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)"☆16Updated 2 years ago
- Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation (CVPR 2024)☆46Updated 9 months ago
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆75Updated 11 months ago
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆112Updated 4 months ago
- ☆77Updated last year
- [CVPR 2023] Token Contrast for Weakly-Supervised Semantic Segmentation☆169Updated 2 years ago
- PA-SAM: Prompt Adapter SAM for High-quality Image Segmentation☆87Updated last year
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆39Updated last year
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆48Updated last year
- ☆19Updated 8 months ago
- ☆26Updated 2 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆50Updated 2 years ago
- This repo collects the research resources based on SAM(Segment Anything Model) proposed by Meta AI. If you would like to contribute, plea…☆46Updated last year
- Hybrid Mamba for Few-Shot Segmentation (NIPS 2024)☆31Updated 9 months ago
- Official code for the NeurIPS 2023 paper "Switching Temporary Teachers for Semi-Supervised Semantic Segmentation"☆50Updated last year
- Code for Part-Guided Relational Transformers for Fine-Grained Visual Recognition, appeared in TIP 2021☆22Updated last year
- ☆51Updated last year
- ☆104Updated last year
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆48Updated this week
- [CVPR 2023] CLIP is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation☆197Updated 10 months ago
- This repository lists some awesome public projects about Zero-shot/Few-shot Learning based on CLIP (Contrastive Language-Image Pre-Traini…☆24Updated 7 months ago
- ☆29Updated 2 years ago
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆88Updated last year
- ☆143Updated last year
- ☆36Updated 2 years ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆195Updated 11 months ago
- [CVPR'23] Augmentation Matters: A Simple-yet-Effective Approach to Semi-supervised Semantic Segmentation☆121Updated last year