Data and code for NeurIPS 2021 Paper "IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning".
☆55Jan 28, 2024Updated 2 years ago
Alternatives and similar repositories for IconQA
Users that are interested in IconQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Mar 3, 2022Updated 4 years ago
- This repo contains code for our ICML 2023 paper: MEWL: Few-shot multimodal word learning with referential uncertainty☆15Jun 10, 2023Updated 2 years ago
- Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning☆11Jul 20, 2022Updated 3 years ago
- What Can You Learn from Your Muscles? Learning Visual Representation from Human Interactions (https://arxiv.org/pdf/2010.08539.pdf)☆39Mar 30, 2021Updated 5 years ago
- Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".☆733Sep 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields" (ECCV 2024)☆12Oct 30, 2024Updated last year
- MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts☆359Sep 29, 2025Updated 7 months ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 3 years ago
- Official Release of NeurIPS 2020 Spotlight paper "Generative Neurosymbolic Machines"☆37Mar 9, 2024Updated 2 years ago
- Code for ICML2018 - Generalized Earley Parser: Bridging Symbolic Grammars and Sequence Data for Future Prediction.☆36Apr 20, 2019Updated 7 years ago
- Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).☆29Sep 4, 2021Updated 4 years ago
- ☆19Jan 9, 2023Updated 3 years ago
- ☆21Oct 10, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Download Web-10K data by querying Bing Image Search☆10Feb 1, 2022Updated 4 years ago
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- [TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.☆146Mar 25, 2023Updated 3 years ago
- Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"☆43Jul 31, 2021Updated 4 years ago
- OpenLock Environment for OpenAI Gym☆19Feb 16, 2021Updated 5 years ago
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆51May 30, 2023Updated 2 years ago
- Code for CVPR2018 - Human-centric Indoor Scene Synthesis Using Stochastic Grammar.☆88Apr 15, 2018Updated 8 years ago
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Aug 10, 2025Updated 9 months ago
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A pytorch implemetation of data augmentation method for visual question answering☆21May 25, 2023Updated 2 years ago
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆23Oct 15, 2024Updated last year
- Neural-Grammar-Symbolic Learning with Back-Search☆55Jul 25, 2024Updated last year
- ☆19Nov 25, 2022Updated 3 years ago
- Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks☆21May 18, 2023Updated 3 years ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Oct 11, 2023Updated 2 years ago
- Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution☆27Mar 18, 2021Updated 5 years ago
- Code for ICCV2021 paper: Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images☆15Jan 24, 2023Updated 3 years ago
- ☆10Oct 1, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MetaStyle: Three-Way Trade-Off Among Speed, Flexibility, and Quality in Neural Style Transfer☆70Jun 17, 2024Updated last year
- [ICML 2022] Latent Diffusion Energy-Based Model for Interpretable Text Modeling☆67Mar 1, 2026Updated 2 months ago
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆150Jun 7, 2023Updated 2 years ago
- Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".☆60Jun 27, 2023Updated 2 years ago
- ☆133Jul 8, 2024Updated last year
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Jul 27, 2021Updated 4 years ago
- ☆33Jul 8, 2024Updated last year