Henrymachiyu / ProtoViTLinks
This code implements ProtoViT, a novel approach that combines Vision Transformers with prototype-based learning to create interpretable image classification models. Our implementation provides both high accuracy and explainability through learned prototypes.
☆35Updated 6 months ago
Alternatives and similar repositories for ProtoViT
Users that are interested in ProtoViT are comparing it to the libraries listed below
Sorting:
- Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution☆63Updated last year
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆89Updated 7 months ago
- Code for the paper "Compositional Entailment Learning for Hyperbolic Vision-Language Models".☆91Updated 5 months ago
- [ICML 2024] Official implementation for "Predictive Dynamic Fusion."☆67Updated 10 months ago
- Adaptation of vision-language models (CLIP) to downstream tasks using local and global prompts.☆49Updated 4 months ago
- ☆45Updated last year
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆339Updated last week
- ☆60Updated last month
- Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models☆145Updated this week
- An easy way to apply LoRA to CLIP. Implementation of the paper "Low-Rank Few-Shot Adaptation of Vision-Language Models" (CLIP-LoRA) [CVPR…☆267Updated 5 months ago
- Official implementation of ResCLIP: Residual Attention for Training-free Dense Vision-language Inference☆53Updated last month
- ☆77Updated 9 months ago
- The code of "Logits DeConfusion with CLIP for Few-Shot Learning" (CVPR 2025)☆61Updated 5 months ago
- The official code repository of ShaSpec model from CVPR 2023 [paper](https://arxiv.org/pdf/2307.14126) "Multi-modal Learning with Missing…☆81Updated 7 months ago
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆44Updated 11 months ago
- [MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Cla…☆47Updated 2 years ago
- [CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning☆94Updated 4 months ago
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆28Updated last year
- Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)☆128Updated 8 months ago
- Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach, CVPR 2024☆22Updated last year
- About Official PyTorch(MMCV) implementation of “SUMix: Mixup with Semantic and Uncertain Information” (ECCV 2024)☆12Updated last year
- [ICLR 2024 Oral] Less is More: Fewer Interpretable Region via Submodular Subset Selection☆86Updated last month
- Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆22Updated 2 weeks ago
- [CVPR 2024] TEA: Test-time Energy Adaptation☆71Updated last year
- This is an official implementation for PROMPT-CAM: A Simpler Interpretable Transformer for Fine-Grained Analysis (CVPR'25)☆57Updated 8 months ago
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆281Updated 2 years ago
- Masked Autoencoder meets GANs☆28Updated last year
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆69Updated 4 months ago
- ☆56Updated 5 months ago
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆56Updated 7 months ago