Imageomics / INTRLinks
This is an official implementation for [ICLR'24] INTR: Interpretable Transformer for Fine-grained Image Classification.
☆51Updated last year
Alternatives and similar repositories for INTR
Users that are interested in INTR are comparing it to the libraries listed below
Sorting:
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling☆100Updated 3 months ago
- PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes☆61Updated last year
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆50Updated last week
- [CVPR 2024] Code for our Paper "DeiT-LT: Distillation Strikes Back for Vision Transformer training on Long-Tailed Datasets"☆41Updated 7 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆77Updated 2 years ago
- ☆42Updated last year
- [CVPR'24] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLAP)…☆75Updated 2 months ago
- ☆41Updated 6 months ago
- ☆61Updated 2 years ago
- ☆71Updated 5 months ago
- This repository contains the pytorch code for our work IEEE ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training…☆80Updated 9 months ago
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)☆32Updated last year
- [ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN☆27Updated 11 months ago
- A Contrastive Learning Boost from Intermediate Pre-Trained Representations☆42Updated 10 months ago
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆38Updated 2 years ago
- 🔥MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer [Official, ICLR 2023]☆21Updated last year
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆75Updated 11 months ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆44Updated 2 years ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆79Updated last year
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆107Updated 2 years ago
- CVPR2024☆85Updated 4 months ago
- LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections (NeurIPS 2023)☆29Updated last year
- Generating Image Specific Text☆28Updated last year
- PyTorch implementation of Semi-supervised Vision Transformers☆59Updated 2 years ago
- This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].☆213Updated last month
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"☆27Updated 3 months ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆51Updated 2 years ago
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆42Updated 8 months ago
- The most impactful papers related to contrastive pretraining for multimodal models!☆68Updated last year
- [ECCV 2022] What to Hide from Your Students: Attention-Guided Masked Image Modeling☆71Updated last year