Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"
☆47Feb 19, 2026Updated 3 months ago
Alternatives and similar repositories for Super-CLEVR
Users that are interested in Super-CLEVR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"☆21Oct 17, 2024Updated last year
- ImageNet3D: Towards General-Purpose Object-Level 3D Understanding☆21Dec 6, 2024Updated last year
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- Official Code for Neural Systematic Binder☆34Mar 27, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Matryoshka Multimodal Models☆123Jan 22, 2025Updated last year
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆57Mar 9, 2025Updated last year
- ☆17Dec 13, 2023Updated 2 years ago
- ☆23Aug 26, 2023Updated 2 years ago
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Feb 13, 2023Updated 3 years ago
- Initial commit☆13Aug 14, 2023Updated 2 years ago
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- ☆18Jul 10, 2024Updated last year
- ☆13May 9, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆138May 16, 2025Updated last year
- Official implementation of GROOT, CoRL 2023☆71Nov 4, 2023Updated 2 years ago
- This is the official code implementation of Bongard-OpenWorld (ICLR 2024).☆14Jan 6, 2025Updated last year
- [CVPR'25 Highlight] A VQA benchmark for 6D spatial reasoning.☆20Apr 29, 2026Updated last month
- This repository provides the code for training the position constrained generative grasp sampler from the paper Constrained Generative Sa…☆22Dec 4, 2024Updated last year
- Local self-attention in Transformer for visual question answering☆13Mar 17, 2024Updated 2 years ago
- Constraint Satisfaction Visual Grounding☆15Aug 10, 2025Updated 10 months ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆650Aug 30, 2021Updated 4 years ago
- Vision-Language-Action Optimization with Trajectory Ensemble Voting☆26Feb 18, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- VHTest☆16Oct 31, 2024Updated last year
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆79Jul 13, 2024Updated last year
- Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"☆25Dec 14, 2023Updated 2 years ago
- [ICRA 2024] SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs☆21Jun 1, 2024Updated 2 years ago
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆327Dec 14, 2024Updated last year
- Korean large emotion labeled dataset (EmoNSMC)☆14Mar 5, 2020Updated 6 years ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆23Nov 8, 2023Updated 2 years ago
- KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation☆22Apr 23, 2025Updated last year
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆35Aug 12, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆14May 23, 2022Updated 4 years ago
- Evaluation codes of "From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models".☆17May 15, 2023Updated 3 years ago
- CoADNet: Collaborative Aggregation-and-Distribution Networks for Co-Salient Object Detection☆19Jan 8, 2021Updated 5 years ago
- ☆33Aug 21, 2025Updated 9 months ago
- Code for Galgali et al, 2023☆14Jan 11, 2023Updated 3 years ago
- Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision☆46Oct 19, 2025Updated 7 months ago
- The official PyTorch Implementation of the Paper "Adversarial Visual Robustness by Causal Intervention"☆18Oct 6, 2021Updated 4 years ago