Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"
☆47Feb 19, 2026Updated 2 months ago
Alternatives and similar repositories for Super-CLEVR
Users that are interested in Super-CLEVR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"☆21Oct 17, 2024Updated last year
- ImageNet3D: Towards General-Purpose Object-Level 3D Understanding☆21Dec 6, 2024Updated last year
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated 2 years ago
- ☆22Aug 7, 2023Updated 2 years ago
- Official Code for Neural Systematic Binder☆34Mar 27, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Matryoshka Multimodal Models☆123Jan 22, 2025Updated last year
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆57Mar 9, 2025Updated last year
- ☆17Dec 13, 2023Updated 2 years ago
- code for Imagination-Policy☆15Dec 1, 2024Updated last year
- ☆23Aug 26, 2023Updated 2 years ago
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Feb 13, 2023Updated 3 years ago
- Initial commit☆13Aug 14, 2023Updated 2 years ago
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- [NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆135May 16, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official implementation of GROOT, CoRL 2023☆69Nov 4, 2023Updated 2 years ago
- [CVPR'25 Highlight] A VQA benchmark for 6D spatial reasoning.☆20Apr 29, 2026Updated last week
- This repository provides the code for training the position constrained generative grasp sampler from the paper Constrained Generative Sa…☆22Dec 4, 2024Updated last year
- This is the official code implementation of Bongard-OpenWorld (ICLR 2024).☆14Jan 6, 2025Updated last year
- VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automa…☆78Dec 5, 2022Updated 3 years ago
- Local self-attention in Transformer for visual question answering☆13Mar 17, 2024Updated 2 years ago
- Constraint Satisfaction Visual Grounding☆15Aug 10, 2025Updated 8 months ago
- ☆16Oct 11, 2021Updated 4 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆646Aug 30, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆42Mar 23, 2024Updated 2 years ago
- Vision-Language-Action Optimization with Trajectory Ensemble Voting☆26Feb 18, 2026Updated 2 months ago
- VHTest☆16Oct 31, 2024Updated last year
- 3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition☆98Oct 14, 2025Updated 6 months ago
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆79Jul 13, 2024Updated last year
- ☆13Feb 3, 2020Updated 6 years ago
- Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"☆25Dec 14, 2023Updated 2 years ago
- [ICRA 2024] SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs☆20Jun 1, 2024Updated last year
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆320Dec 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Korean large emotion labeled dataset (EmoNSMC)☆14Mar 5, 2020Updated 6 years ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆23Nov 8, 2023Updated 2 years ago
- KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation☆22Apr 23, 2025Updated last year
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆35Aug 12, 2024Updated last year
- ☆18Aug 1, 2024Updated last year
- Official code for our COLING 2022 paper: In-Context Learning for Empathetic Dialogue Generation☆20Mar 1, 2023Updated 3 years ago
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Jun 16, 2024Updated last year