Imageomics / INTR
This is an official implementation for [ICLR'24] INTR: Interpretable Transformer for Fine-grained Image Classification.
☆33Updated 5 months ago
Related projects: ⓘ
- [CVPR 2024] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLA…☆53Updated 3 months ago
- Union-set Multi-source Model Adaptation for Semantic Segmentation☆12Updated last year
- ☆17Updated last year
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 2 years ago
- ☆37Updated 10 months ago
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)☆19Updated 3 months ago
- 🔥MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer [Official, ICLR 2023]☆18Updated 10 months ago
- [CVPR 2023] Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection☆32Updated last year
- [CVPR 2024] Code for our Paper "DeiT-LT: Distillation Strikes Back for Vision Transformer training on Long-Tailed Datasets"☆33Updated 3 weeks ago
- MaskCon: Masked Contrastive Learning for Coarse-Labeled Dataset (CVPR2023)☆31Updated 7 months ago
- Generating Image Specific Text☆21Updated last year
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆86Updated 8 months ago
- LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections (NeurIPS 2023)☆24Updated 8 months ago
- TRT for WSOL☆29Updated 10 months ago
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆32Updated last year
- [WACV 2024] Instruct Me More! Random Prompting for Visual In-Context Learning☆13Updated 5 months ago
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning☆54Updated last month
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆23Updated last year
- Validating image classification benchmark results on ViTs and ResNets (v2)☆12Updated last year
- This repository includes the official project of L2B, from our paper "Learning to Bootstrap for Combating Label Noise".☆28Updated 2 months ago
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆15Updated last year
- ☆40Updated last year
- LeemSaebom / Attention-Guided-CAM-Visual-Explanations-of-Vision-Transformer-Guided-by-Self-AttentionThe official code for Attention Guided CAM: Visual Explanations of Vision Transformer Guided by Self-Attention☆11Updated 7 months ago
- Domain Generalization through Distilling CLIP with Language Guidance☆25Updated 11 months ago
- ☆54Updated last year
- [CVPR 2024] PriViLege: Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners☆28Updated 2 weeks ago
- ☆16Updated 3 weeks ago
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆31Updated last month
- Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification. ECCV 2022.☆19Updated 2 years ago
- [arXiv'23] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆30Updated last month