robert-mcdermott / LLM-Image-Classification
Image Classification Testing with LLMs
☆62Updated last year
Alternatives and similar repositories for LLM-Image-Classification:
Users that are interested in LLM-Image-Classification are comparing it to the libraries listed below
- [CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning☆69Updated last week
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆72Updated last year
- ☆52Updated 8 months ago
- The most impactful papers related to contrastive pretraining for multimodal models!☆62Updated last year
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆36Updated 8 months ago
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆260Updated last year
- Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification☆107Updated last year
- ☆16Updated last year
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆96Updated 11 months ago
- FInetuning CLIP for Few Shot Learning☆40Updated 3 years ago
- Official PyTorch Implementation for Active Prompt Learning in Vision Language Models☆29Updated 9 months ago
- ☆37Updated 2 months ago
- Code for paper "Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters" CVPR2024☆200Updated 4 months ago
- code for studying OpenAI's CLIP explainability☆30Updated 3 years ago
- [CVPR'24] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLAP)…☆68Updated 9 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆68Updated last month
- The code for paper: PeFoM-Med: Parameter Efficient Fine-tuning on Multi-modal Large Language Models for Medical Visual Question Answering☆43Updated 4 months ago
- Awesome Fine-Grained Image Classification☆78Updated 7 months ago
- Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"☆77Updated 10 months ago
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆56Updated last month
- Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels [CVPR 2023]☆12Updated last year
- ☆64Updated 2 months ago
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆66Updated 3 months ago
- Generating Image Specific Text☆27Updated last year
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)☆29Updated 10 months ago
- ☆30Updated 3 months ago
- Official implementation of "Delving into CLIP latent space for Video Anomaly Recognition", CVIU 2024☆53Updated 4 months ago
- Visual self-questioning for large vision-language assistant.☆40Updated 5 months ago
- Validating image classification benchmark results on ViTs and ResNets (v2)☆12Updated 2 years ago
- ☆19Updated last year