robert-mcdermott / LLM-Image-Classification
Image Classification Testing with LLMs
☆57Updated last year
Alternatives and similar repositories for LLM-Image-Classification:
Users that are interested in LLM-Image-Classification are comparing it to the libraries listed below
- Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification☆106Updated 11 months ago
- ☆61Updated 6 months ago
- ☆61Updated this week
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆263Updated last month
- [CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning☆61Updated 3 weeks ago
- [ICML 2024] Let Go of Your Labels with Unsupervised Transfer☆52Updated 7 months ago
- [ICLR'25] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models☆90Updated last week
- Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]☆30Updated 3 months ago
- DreamDA: Generative Data Augmentation with Diffusion Models (Official Implementation)☆24Updated 3 months ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆80Updated 10 months ago
- [CVPR'24] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLAP)…☆64Updated 8 months ago
- The official repository implement of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with…☆61Updated 2 months ago
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆65Updated 8 months ago
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆249Updated last year
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…☆77Updated 4 months ago
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆60Updated 3 months ago
- [CVPR 2024] Code for our Paper "DeiT-LT: Distillation Strikes Back for Vision Transformer training on Long-Tailed Datasets"☆39Updated last month
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆71Updated 8 months ago
- ☆16Updated last year
- ICCV 2023: CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No☆132Updated last year
- Bilingual Medical Mixture of Experts LLM☆28Updated 2 months ago
- Towards Evaluating the Robustness of Visual State Space Models☆24Updated 4 months ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆103Updated 11 months ago
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆104Updated 7 months ago
- [ACCV 2024] ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes 🚀🚀🚀☆35Updated last week
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆18Updated 3 months ago
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆180Updated last year
- The code for paper: PeFoM-Med: Parameter Efficient Fine-tuning on Multi-modal Large Language Models for Medical Visual Question Answering☆37Updated 2 months ago
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆45Updated 2 months ago
- Plotting heatmaps with the self-attention of the [CLS] tokens in the last layer.☆41Updated 2 years ago