Henrymachiyu / ProtoViT
This code implements ProtoViT, a novel approach that combines Vision Transformers with prototype-based learning to create interpretable image classification models. Our implementation provides both high accuracy and explainability through learned prototypes.
☆15Updated 3 weeks ago
Alternatives and similar repositories for ProtoViT:
Users that are interested in ProtoViT are comparing it to the libraries listed below
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆34Updated 2 months ago
- CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification☆86Updated 8 months ago
- Official code for ICLR 2024 paper, "A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation"☆73Updated 9 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆67Updated 3 weeks ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆103Updated 11 months ago
- ☆61Updated this week
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆50Updated 6 months ago
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆64Updated last month
- [MICCAI 2024] Official code repository of paper titled "BAPLe: Backdoor Attacks on Medical Foundation Models using Prompt Learning" accep…☆53Updated 3 months ago
- [CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation☆94Updated 6 months ago
- ☆60Updated 6 months ago
- [NeurIPS2023] LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning☆91Updated 5 months ago
- [NeurIPS 2024] Code for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models☆30Updated last month
- Self-training LLaVA for medical☆14Updated 2 months ago
- [ACLW'24] LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition☆50Updated 5 months ago
- ☆16Updated 3 months ago
- This is the official repository for "LoRKD: Low-Rank Knowledge Decomposition for Medical Foundation Models"☆10Updated 3 months ago
- [NeurIPS 2024 Spotlight] Code for the paper "Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts"☆27Updated 3 months ago
- [CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning☆61Updated 3 weeks ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning☆39Updated 6 months ago
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆153Updated last year
- Code for the paper "Explain Any Concept: Segment Anything Meets Concept-Based Explanation". Poster @ NeurIPS 2023☆41Updated last year
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆11Updated last month
- Implementation of the paper "PerSense: Personalized Instance Segmentation in Dense Images"☆24Updated 2 weeks ago
- The official implementation of CVPR 24' Paper "Learning Transferable Negative Prompts for Out-of-Distribution Detection"☆48Updated 9 months ago
- The official implementation of the paper **Learning Concise and Descriptive Attributes for Visual Recognition**☆42Updated last year
- MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization☆15Updated last month
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆96Updated 10 months ago
- [AAAI'25, CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆97Updated last month
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆31Updated last year