dimipapa / cookingprogramsLinks
Code and data for "Learning Program Representations for Food Images and Cooking Recipes" (oral at CVPR 2022)
☆15Updated 3 years ago
Alternatives and similar repositories for cookingprograms
Users that are interested in cookingprograms are comparing it to the libraries listed below
Sorting:
- VisualCOMET: Reasoning about the Dynamic Context of a Still Image☆87Updated 2 years ago
- ☆40Updated 2 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆115Updated 2 years ago
- PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)☆373Updated 2 years ago
- [ICML 2022] This is the pytorch implementation of "Rethinking Attention-Model Explainability through Faithfulness Violation Test" (https:…☆19Updated 3 years ago
- Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO☆52Updated 5 years ago
- Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)☆133Updated last year
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆125Updated 3 years ago
- ☆106Updated 3 years ago
- MERLOT: Multimodal Neural Script Knowledge Models☆224Updated 3 years ago
- Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).☆29Updated 4 years ago
- [CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The…☆71Updated 2 months ago
- Dataset and starting code for visual entailment dataset☆111Updated 3 years ago
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆34Updated 2 years ago
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆206Updated 2 years ago
- An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)☆85Updated 3 years ago
- ☆23Updated 2 years ago
- Code for the ICCV'21 paper "Context-aware Scene Graph Generation with Seq2Seq Transformers"☆43Updated 3 years ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated 2 years ago
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)☆199Updated last year
- Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization.☆138Updated 2 years ago
- Bridging Knowledge Graphs to Generate Scene Graphs, ECCV 2020☆70Updated last year
- [EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning☆99Updated last year
- ☆120Updated 2 years ago
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆48Updated last year
- ☆12Updated 5 years ago
- The SVO-Probes Dataset for Verb Understanding☆31Updated 3 years ago
- Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering☆27Updated last year
- GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering☆65Updated 4 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆188Updated 4 months ago