Data and code for NeurIPS 2021 Paper "IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning".
☆55Jan 28, 2024Updated 2 years ago
Alternatives and similar repositories for IconQA
Users that are interested in IconQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Mar 3, 2022Updated 4 years ago
- This repo contains code for our ICML 2023 paper: MEWL: Few-shot multimodal word learning with referential uncertainty☆15Jun 10, 2023Updated 2 years ago
- ACRE: Abstract Causal REasoning Beyond Covariation☆19Dec 7, 2021Updated 4 years ago
- ☆14Jun 1, 2022Updated 3 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆19Oct 4, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆24Jun 18, 2025Updated 9 months ago
- Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".☆731Sep 19, 2024Updated last year
- Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".☆165Dec 27, 2023Updated 2 years ago
- ☆19Jul 5, 2023Updated 2 years ago
- MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts☆358Sep 29, 2025Updated 6 months ago
- Official Repository of NeurIPS2021 paper: PTR☆32Dec 17, 2021Updated 4 years ago
- Translation of the databricks-dolly-15k dataset to Chinese for commercial use.☆19Apr 17, 2023Updated 2 years ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆211Dec 18, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- [NeurIPS 2023] Learning Energy-Based Prior Model with Diffusion-Amortized MCMC☆13Mar 1, 2026Updated last month
- ☆37Oct 7, 2023Updated 2 years ago
- Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).☆29Sep 4, 2021Updated 4 years ago
- ☆21Oct 10, 2023Updated 2 years ago
- Download Web-10K data by querying Bing Image Search☆10Feb 1, 2022Updated 4 years ago
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- [TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.☆143Mar 25, 2023Updated 3 years ago
- OpenLock Environment for OpenAI Gym☆19Feb 16, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆23Aug 26, 2024Updated last year
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Aug 10, 2025Updated 8 months ago
- PyTorch re-implementation of Multi-Object Representation Learning with Iterative Variational Inference☆59Sep 3, 2022Updated 3 years ago
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆23Oct 15, 2024Updated last year
- Neural-Grammar-Symbolic Learning with Back-Search☆55Jul 25, 2024Updated last year
- ☆19Nov 25, 2022Updated 3 years ago
- PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)☆23Nov 29, 2022Updated 3 years ago
- Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks☆21May 18, 2023Updated 2 years ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Oct 11, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution☆27Mar 18, 2021Updated 5 years ago
- Codes of CVPR2022 paper: Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction☆32Aug 23, 2022Updated 3 years ago
- MMBERT: Multimodal BERT Pretraining for Improved Medical VQA☆39Mar 22, 2021Updated 5 years ago
- MetaStyle: Three-Way Trade-Off Among Speed, Flexibility, and Quality in Neural Style Transfer☆71Jun 17, 2024Updated last year
- [ICML 2022] Latent Diffusion Energy-Based Model for Interpretable Text Modeling☆67Mar 1, 2026Updated last month
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆151Jun 7, 2023Updated 2 years ago
- Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".☆59Jun 27, 2023Updated 2 years ago