Data and code for NeurIPS 2021 Paper "IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning".
☆55Jan 28, 2024Updated 2 years ago
Alternatives and similar repositories for IconQA
Users that are interested in IconQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains code for our ICML 2023 paper: MEWL: Few-shot multimodal word learning with referential uncertainty☆15Jun 10, 2023Updated 2 years ago
- ACRE: Abstract Causal REasoning Beyond Covariation☆19Dec 7, 2021Updated 4 years ago
- Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning☆11Jul 20, 2022Updated 3 years ago
- ☆14Jun 1, 2022Updated 3 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆19Oct 4, 2022Updated 3 years ago
- What Can You Learn from Your Muscles? Learning Visual Representation from Human Interactions (https://arxiv.org/pdf/2010.08539.pdf)☆39Mar 30, 2021Updated 4 years ago
- ☆24Jun 18, 2025Updated 9 months ago
- Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".☆728Sep 19, 2024Updated last year
- Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".☆165Dec 27, 2023Updated 2 years ago
- Code for "SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields" (ECCV 2024)☆12Oct 30, 2024Updated last year
- ☆39Oct 5, 2022Updated 3 years ago
- MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts☆355Sep 29, 2025Updated 5 months ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆211Dec 18, 2022Updated 3 years ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations☆42Jun 30, 2021Updated 4 years ago
- ☆37Oct 7, 2023Updated 2 years ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 4 years ago
- Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).☆29Sep 4, 2021Updated 4 years ago
- ☆19Jan 9, 2023Updated 3 years ago
- ☆21Oct 10, 2023Updated 2 years ago
- Download Web-10K data by querying Bing Image Search☆10Feb 1, 2022Updated 4 years ago
- [TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.☆140Mar 25, 2023Updated 2 years ago
- Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"☆43Jul 31, 2021Updated 4 years ago
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆51May 30, 2023Updated 2 years ago
- Code for CVPR2018 - Human-centric Indoor Scene Synthesis Using Stochastic Grammar.☆88Apr 15, 2018Updated 7 years ago
- ☆23Aug 26, 2024Updated last year
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 4 years ago
- PyTorch re-implementation of Multi-Object Representation Learning with Iterative Variational Inference☆59Sep 3, 2022Updated 3 years ago
- A pytorch implemetation of data augmentation method for visual question answering☆21May 25, 2023Updated 2 years ago
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆23Oct 15, 2024Updated last year
- Supplementary material for the ISMIR 2020 paper: “Deconstruct, Analyse, Reconstruct: how to improve tempo, beat, and downbeat estimation”…☆11Mar 2, 2021Updated 5 years ago
- Neural-Grammar-Symbolic Learning with Back-Search☆55Jul 25, 2024Updated last year
- ☆19Nov 25, 2022Updated 3 years ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Oct 11, 2023Updated 2 years ago
- Code for ICCV2021 paper: Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images☆15Jan 24, 2023Updated 3 years ago
- Codes of CVPR2022 paper: Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction☆32Aug 23, 2022Updated 3 years ago
- Learning Precise Affordances from Egocentric Videos for Robotic Manipulation (ICCV 2025)☆19Jan 30, 2026Updated last month
- MMBERT: Multimodal BERT Pretraining for Improved Medical VQA☆39Mar 22, 2021Updated 5 years ago