Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"
☆47Feb 19, 2026Updated last month
Alternatives and similar repositories for Super-CLEVR
Users that are interested in Super-CLEVR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ImageNet3D: Towards General-Purpose Object-Level 3D Understanding☆21Dec 6, 2024Updated last year
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated 2 years ago
- ☆22Aug 7, 2023Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆55Mar 9, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆17Dec 13, 2023Updated 2 years ago
- code for Imagination-Policy☆15Dec 1, 2024Updated last year
- ☆23Aug 26, 2023Updated 2 years ago
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Feb 13, 2023Updated 3 years ago
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Jan 17, 2022Updated 4 years ago
- Initial commit☆13Aug 14, 2023Updated 2 years ago
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- ☆18Jul 10, 2024Updated last year
- ☆13May 9, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This repository provides the code for training the position constrained generative grasp sampler from the paper Constrained Generative Sa…☆22Dec 4, 2024Updated last year
- [CVPR'25] A vision question answering (VQA) benchmark for 6D spatial reasoning.☆20Mar 8, 2026Updated 2 weeks ago
- This is the official code implementation of Bongard-OpenWorld (ICLR 2024).☆14Jan 6, 2025Updated last year
- VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automa…☆78Dec 5, 2022Updated 3 years ago
- Local self-attention in Transformer for visual question answering☆13Mar 17, 2024Updated 2 years ago
- ☆16Oct 11, 2021Updated 4 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆645Aug 30, 2021Updated 4 years ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆42Mar 23, 2024Updated 2 years ago
- VHTest☆16Oct 31, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition☆98Oct 14, 2025Updated 5 months ago
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆77Jul 13, 2024Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"☆25Dec 14, 2023Updated 2 years ago
- [ICRA 2024] SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs☆20Jun 1, 2024Updated last year
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆317Dec 14, 2024Updated last year
- Korean large emotion labeled dataset (EmoNSMC)☆14Mar 5, 2020Updated 6 years ago
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆35Aug 12, 2024Updated last year
- KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation☆22Apr 23, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18Aug 1, 2024Updated last year
- Official code for our COLING 2022 paper: In-Context Learning for Empathetic Dialogue Generation☆20Mar 1, 2023Updated 3 years ago
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Jun 16, 2024Updated last year
- ☆15May 23, 2022Updated 3 years ago
- Evaluation codes of "From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models".☆16May 15, 2023Updated 2 years ago
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models☆155Apr 30, 2024Updated last year
- Code for Galgali et al, 2023☆14Jan 11, 2023Updated 3 years ago