Data and code for NeurIPS 2021 Paper "IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning".
☆55Jan 28, 2024Updated 2 years ago
Alternatives and similar repositories for IconQA
Users that are interested in IconQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Mar 3, 2022Updated 4 years ago
- This repo contains code for our ICML 2023 paper: MEWL: Few-shot multimodal word learning with referential uncertainty☆15Jun 10, 2023Updated 2 years ago
- ☆14Jun 1, 2022Updated 3 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆19Oct 4, 2022Updated 3 years ago
- ☆24Jun 18, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".☆733Sep 19, 2024Updated last year
- MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts☆358Sep 29, 2025Updated 7 months ago
- Official Repository of NeurIPS2021 paper: PTR☆32Dec 17, 2021Updated 4 years ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆211Dec 18, 2022Updated 3 years ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- Use FAtiMA-Toolkit to create agents for Don't Starve Together.☆20Nov 21, 2018Updated 7 years ago
- [NeurIPS 2023] Learning Energy-Based Prior Model with Diffusion-Amortized MCMC☆13Mar 1, 2026Updated 2 months ago
- Code for ICML2018 - Generalized Earley Parser: Bridging Symbolic Grammars and Sequence Data for Future Prediction.☆36Apr 20, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆37Oct 7, 2023Updated 2 years ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 4 years ago
- Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).☆29Sep 4, 2021Updated 4 years ago
- ☆19Jan 9, 2023Updated 3 years ago
- ☆21Oct 10, 2023Updated 2 years ago
- Download Web-10K data by querying Bing Image Search☆10Feb 1, 2022Updated 4 years ago
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- [TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.☆143Mar 25, 2023Updated 3 years ago
- Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"☆43Jul 31, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- OpenLock Environment for OpenAI Gym☆19Feb 16, 2021Updated 5 years ago
- ☆23Aug 26, 2024Updated last year
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 5 years ago
- PyTorch re-implementation of Multi-Object Representation Learning with Iterative Variational Inference☆59Sep 3, 2022Updated 3 years ago
- A pytorch implemetation of data augmentation method for visual question answering☆21May 25, 2023Updated 2 years ago
- Supplementary material for the ISMIR 2020 paper: “Deconstruct, Analyse, Reconstruct: how to improve tempo, beat, and downbeat estimation”…☆12Mar 2, 2021Updated 5 years ago
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆23Oct 15, 2024Updated last year
- Neural-Grammar-Symbolic Learning with Back-Search☆55Jul 25, 2024Updated last year
- ☆19Nov 25, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)☆23Nov 29, 2022Updated 3 years ago
- Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks☆21May 18, 2023Updated 2 years ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Oct 11, 2023Updated 2 years ago
- Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution☆27Mar 18, 2021Updated 5 years ago
- ☆20Apr 5, 2026Updated 3 weeks ago
- Code for ICCV2021 paper: Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images☆15Jan 24, 2023Updated 3 years ago
- Codes of CVPR2022 paper: Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction☆32Aug 23, 2022Updated 3 years ago