☆27Aug 28, 2023Updated 2 years ago
Alternatives and similar repositories for DataDownload
Users that are interested in DataDownload are comparing it to the libraries listed below
Sorting:
- Toolkit for Elevater Benchmark☆77Oct 17, 2023Updated 2 years ago
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago
- A codebase for flexible and efficient Image Text Representation Alignment☆23Jun 20, 2023Updated 2 years ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 2 years ago
- ☆13Jul 2, 2025Updated 8 months ago
- How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?☆13Aug 16, 2023Updated 2 years ago
- Paper Reading of IMCC groups.☆17Oct 22, 2025Updated 5 months ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 2 years ago
- 🦩 Official repository of paper "Visual Instruction Tuning with Polite Flamingo" (AAAI-24 Oral)☆65Dec 9, 2023Updated 2 years ago
- An Examination of the Compositionality of Large Generative Vision-Language Models☆19Apr 9, 2024Updated last year
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆47Sep 25, 2023Updated 2 years ago
- ☆17Mar 6, 2023Updated 3 years ago
- On-Device Domain Generalization☆46Nov 9, 2022Updated 3 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"☆106Aug 7, 2023Updated 2 years ago
- ☆27Jul 20, 2024Updated last year
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- Code accompanying paper "Fine-Grained Visual Entailment" [ECCV 2022].☆11Oct 31, 2022Updated 3 years ago
- AlphaFace: High Fidelity and Real-time Face Swapper Robust to Facial Pose☆36Jan 23, 2026Updated last month
- [NeurIPS 2023] Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models☆22Oct 21, 2025Updated 5 months ago
- ☆17Dec 13, 2023Updated 2 years ago
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆188Jun 21, 2025Updated 9 months ago
- ☆125Feb 21, 2023Updated 3 years ago
- ☆29Jan 23, 2024Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- point-light photometric stereo for metal surface reconstruction☆17Mar 15, 2023Updated 3 years ago
- ☆45Jun 2, 2025Updated 9 months ago
- Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017☆14Aug 7, 2018Updated 7 years ago
- Source Code for Neurips 2023 Publication: <ChatGPT-Powered Hierarchical Comparisons for Image Classification Download PDF>☆26Jul 10, 2024Updated last year
- ☆30May 9, 2024Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- NegCLIP.☆39Feb 6, 2023Updated 3 years ago
- ☆18Jul 10, 2024Updated last year
- ☆19Oct 22, 2023Updated 2 years ago
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆28Oct 12, 2021Updated 4 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆141Dec 16, 2025Updated 3 months ago
- Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …☆292Jun 7, 2023Updated 2 years ago
- NICE challenge 2023 Track2 2nd result(total 4th) (CVPR 2023) sponsered by LG AI/Shutterstock/SNU☆11Jun 22, 2023Updated 2 years ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges☆30Sep 24, 2023Updated 2 years ago