naver-ai / eccv-captionLinks
Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)
☆55Updated last year
Alternatives and similar repositories for eccv-caption
Users that are interested in eccv-caption are comparing it to the libraries listed below
Sorting:
- ☆46Updated last year
- ☆51Updated 2 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆108Updated 3 months ago
- Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)☆132Updated last year
- Code and data for ImageCoDe, a contextual vison-and-language benchmark☆40Updated last year
- The SVO-Probes Dataset for Verb Understanding☆31Updated 3 years ago
- This is an official implementation of GRIT-VLP☆21Updated 2 years ago
- ☆120Updated 2 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆48Updated last year
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆48Updated last year
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆55Updated 11 months ago
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)☆197Updated last year
- ☆38Updated last year
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆138Updated 2 years ago
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆205Updated 2 years ago
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆66Updated 2 years ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated 2 years ago
- Large-Scale Bidirectional Training for Zero-Shot Image Captioning☆21Updated 2 years ago
- Data repository for the VALSE benchmark.☆37Updated last year
- [ECCV 2024] Official PyTorch implementation of "HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts"☆18Updated 7 months ago
- Visual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers☆94Updated last year
- Toolkit for Elevater Benchmark☆73Updated last year
- PyTorch code for MUST☆108Updated 2 months ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated 2 years ago
- Reliably download millions of images efficiently☆116Updated 4 years ago
- ☆29Updated 2 years ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆34Updated 2 years ago
- A PyTorch implementation of Multimodal Few-Shot Learning with Frozen Language Models with OPT.☆43Updated 2 years ago
- Implementation for the paper "Reliable Visual Question Answering Abstain Rather Than Answer Incorrectly" (ECCV 2022: https//arxiv.org/abs…☆34Updated 2 years ago
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆57Updated 2 years ago