Computer-Vision-in-the-Wild / Elevater_Toolkit_ICView external linksLinks
Toolkit for Elevater Benchmark
☆76Oct 17, 2023Updated 2 years ago
Alternatives and similar repositories for Elevater_Toolkit_IC
Users that are interested in Elevater_Toolkit_IC are comparing it to the libraries listed below
Sorting:
- ☆27Aug 28, 2023Updated 2 years ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 2 years ago
- 📍 Official repository of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS 2023)☆55Nov 8, 2023Updated 2 years ago
- Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"☆106Aug 7, 2023Updated 2 years ago
- [NeurIPS 2023] A faithful benchmark for vision-language compositionality☆89Feb 13, 2024Updated 2 years ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- On-Device Domain Generalization☆46Nov 9, 2022Updated 3 years ago
- PyTorch code for MUST☆108May 1, 2025Updated 9 months ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆110Dec 8, 2023Updated 2 years ago
- ☆124Feb 21, 2023Updated 2 years ago
- [ECCV 2022] Tackling Long-Tailed Category Distribution Under Domain Shifts☆25Nov 29, 2022Updated 3 years ago
- Create generated datasets and train robust classifiers☆36Sep 1, 2023Updated 2 years ago
- [CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"☆405Nov 10, 2023Updated 2 years ago
- A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''☆1,354Mar 14, 2024Updated last year
- (IJCV 2023) Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"☆13Mar 20, 2025Updated 10 months ago
- ☆10Jul 5, 2024Updated last year
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- Code for the EMNLP 2021 Oral paper "Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search" https://arx…☆12Feb 6, 2023Updated 3 years ago
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Jun 11, 2023Updated 2 years ago
- SimMatchV2: Semi-Supervised Learning with Graph Consistency☆22Dec 26, 2023Updated 2 years ago
- Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …☆292Jun 7, 2023Updated 2 years ago
- REACT (CVPR 2023, Highlight 2.5%)☆142Apr 7, 2023Updated 2 years ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆106Aug 22, 2023Updated 2 years ago
- [TPAMI] Searching prompt modules for parameter-efficient transfer learning.☆238Dec 8, 2023Updated 2 years ago
- Non-local Modeling for Image Quality Assessment☆13Dec 20, 2023Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆37Aug 18, 2024Updated last year
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆55Apr 7, 2025Updated 10 months ago
- ☆12Jan 25, 2023Updated 3 years ago
- CVPR 2021 "Camera Pose Matters: Improving Depth Prediction by Mitigating Pose Distribution Bias"☆13Sep 19, 2021Updated 4 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 8 months ago
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆27Nov 29, 2023Updated 2 years ago
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆15Oct 12, 2023Updated 2 years ago
- Test-Time Distribution Normalization For Contrastively Learned Vision-language Models☆27Jan 15, 2024Updated 2 years ago
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆84Aug 16, 2022Updated 3 years ago
- Official implementation of T2Vs Meet VLMs: A Scalable Multimodal Dataset for Visual Harmfulness Recognition☆21Oct 23, 2024Updated last year
- ☆15Mar 31, 2022Updated 3 years ago
- 示范faster-RCNN-pytorch如何生成20000个anchor box及可视化最终的图像☆13Jul 6, 2019Updated 6 years ago