Imageomics / INTRLinks

This is an official implementation for [ICLR'24] INTR: Interpretable Transformer for Fine-grained Image Classification.

☆51

Alternatives and similar repositories for INTR

Users that are interested in INTR are comparing it to the libraries listed below

Sorting:

Haochen-Wang409 / HPM
[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling
☆100Updated 3 months ago
bwconrad / flexivit
PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes
☆61Updated last year
dogehhh / ReCLIP
Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation
☆50Updated last week
val-iisc / DeiT-LT
[CVPR 2024] Code for our Paper "DeiT-LT: Distillation Strikes Back for Vision Transformer training on Long-Tailed Datasets"
☆41Updated 7 months ago
Jiahao000 / MFM
[ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training
☆77Updated 2 years ago
akhtarvision / cal-detr
☆42Updated last year
jusiro / CLAP
[CVPR'24] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLAP)…
☆75Updated 2 months ago
cvl-umass / AdaptCLIPZS
☆41Updated 6 months ago
AbrahamYabo / SdAE
☆61Updated 2 years ago
wangf3014 / Mamba-Reg
☆71Updated 5 months ago
aleemsidra / ConvLoRA
This repository contains the pytorch code for our work IEEE ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training…
☆80Updated 9 months ago
microsoft / A-CLIP
Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)
☆32Updated last year
Westlake-AI / A2MIM
[ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
☆27Updated 11 months ago
ml-jku / MIM-Refiner
A Contrastive Learning Boost from Intermediate Pre-Trained Representations
☆42Updated 10 months ago
ucasligang / SemMAE
[NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders
☆38Updated 2 years ago
fistyee / MixPro
🔥MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer [Official, ICLR 2023]
☆21Updated last year
raytrun / mamba-clip
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation
☆75Updated 11 months ago
ZrrSkywalker / CaFo
[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
☆44Updated 2 years ago
wgcban / adamae
[CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders
☆79Updated last year
UCSC-VLAA / DMAE
[CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"
☆107Updated 2 years ago
zbf1991 / WeCLIP
CVPR2024
☆85Updated 4 months ago
jmiemirza / LaFTer
LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections (NeurIPS 2023)
☆29Updated last year
emu1729 / GIST
Generating Image Specific Text
☆28Updated last year
amazon-science / semi-vit
PyTorch implementation of Semi-supervised Vision Transformers
☆59Updated 2 years ago
Imageomics / bioclip
This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].
☆213Updated last month
m1k2zoo / negbench
Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"
☆27Updated 3 months ago
Haoqing-Wang / LocalMIM
[CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction
☆51Updated 2 years ago
richard-peng-xia / HGCLIP
[COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding
☆42Updated 8 months ago
jacobmarks / awesome-clip-papers
The most impactful papers related to contrastive pretraining for multimodal models!
☆68Updated last year
gkakogeorgiou / attmask
[ECCV 2022] What to Hide from Your Students: Attention-Guided Masked Image Modeling
☆71Updated last year