Computer-Vision-in-the-Wild / Elevater_Toolkit_IC
Toolkit for Elevater Benchmark
☆65Updated 11 months ago
Related projects: ⓘ
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆44Updated 11 months ago
- Compress conventional Vision-Language Pre-training data☆49Updated 11 months ago
- ☆22Updated last year
- ☆55Updated last year
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆45Updated 10 months ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆130Updated last year
- Dataset pruning for ImageNet and LAION-2B.☆62Updated 2 months ago
- ☆56Updated 2 years ago
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆79Updated 2 years ago
- This repo is the official implementation of UPL (Unsupervised Prompt Learning for Vision-Language Models).☆103Updated 2 years ago
- ☆100Updated last year
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆160Updated 6 months ago
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆67Updated 5 months ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆22Updated 3 months ago
- ☆55Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆31Updated last year
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆91Updated last year
- ☆60Updated last year
- Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".☆54Updated last year
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆31Updated last year
- ☆63Updated 11 months ago
- ☆83Updated 9 months ago
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆68Updated 7 months ago
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆148Updated last year
- This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…☆66Updated 5 months ago
- VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition☆62Updated last year
- NegCLIP.☆23Updated last year
- [arXiv] Cross-Modal Adapter for Text-Video Retrieval☆53Updated last year
- [CVPR2022] PyTorch re-implementation of Prompt Distribution Learning☆14Updated last year
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuning☆146Updated last year