chuangg / CLEVRER
PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"
☆114Updated 4 years ago
Alternatives and similar repositories for CLEVRER:
Users that are interested in CLEVRER are comparing it to the libraries listed below
- Neural-symbolic visual question answering☆262Updated last year
- CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning☆103Updated 4 years ago
- PyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019☆48Updated 5 years ago
- Learning Long-term Visual Dynamics with Region Proposal Interaction Networks (ICLR 2021)☆112Updated 2 years ago
- Neural State Machine implemented in PyTorch☆70Updated 5 years ago
- ☆38Updated 2 years ago
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆37Updated 6 months ago
- This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).☆37Updated 6 months ago
- ☆40Updated 11 months ago
- [NeurIPS 2021 Spotlight] Learning to Compose Visual Relations☆101Updated last year
- This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes …☆84Updated 2 years ago
- Code for the paper Learning the Predictability of the Future (CVPR 2021)☆163Updated last year
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆44Updated 4 years ago
- Library for the training and evaluation of object-centric models (ICML 2022)☆65Updated last year
- Official PyTorch implementation of "Improving Generative Imagination in Object-Centric World Models"☆35Updated 2 years ago
- Code for Look for the Change paper published at CVPR 2022☆35Updated 2 years ago
- Code, data and benchmark from the paper "Unmasking the Inductive Biases of Unsupervised Object Representations for Video Sequences".☆36Updated 3 years ago
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆88Updated last year
- [NeurIPS 2021] Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language☆45Updated last year
- An implementation of the MONet model for unsupervised scene decomposition in PyTorch☆58Updated 2 years ago
- Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization.☆132Updated last year
- Neural Networks that convert input movies into Physical Scene Graphs (PSGs)☆62Updated 3 years ago
- ☆86Updated 2 years ago
- Official Repository of NeurIPS2021 paper: PTR☆33Updated 3 years ago
- PyTorch re-implementation of Multi-Object Representation Learning with Iterative Variational Inference☆58Updated 2 years ago
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆29Updated 2 years ago
- Multi-object image datasets with ground-truth segmentation masks and generative factors.☆260Updated 3 years ago
- ☆66Updated last year
- Differentiable First-Order Logic Reasoning for Visual Question Answering☆39Updated 3 years ago
- Cornell Touchdown natural language navigation and spatial reasoning dataset.☆99Updated 4 years ago