PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"
☆128Nov 6, 2020Updated 5 years ago
Alternatives and similar repositories for CLEVRER
Users that are interested in CLEVRER are comparing it to the libraries listed below
Sorting:
- This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).☆37Jul 8, 2024Updated last year
- Neural-symbolic visual question answering☆280Mar 27, 2023Updated 2 years ago
- [NeurIPS 2021] Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language☆47Apr 11, 2023Updated 2 years ago
- [ICRA 2019] Propagation Networks for Model-based Control Under Partial Observation☆48Apr 18, 2019Updated 6 years ago
- ☆40Jul 19, 2022Updated 3 years ago
- Burgess et al. "MONet: Unsupervised Scene Decomposition and Representation"☆89Dec 3, 2022Updated 3 years ago
- Repo for "Physion: Evaluating Physical Prediction from Vision in Humans and Machines", presented at NeurIPS 2021 (Datasets & Benchmarks t…☆85Feb 9, 2023Updated 3 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆643Aug 30, 2021Updated 4 years ago
- An implementation of DIP-VAE from the paper "Variational Inference of Disentangled Latent Concepts from Unlabelled Observations" by Kumar…☆26Apr 20, 2018Updated 7 years ago
- [NeurIPS 2020] Causal Discovery in Physical Systems from Videos☆82Oct 31, 2022Updated 3 years ago
- [ICLR 2019] ]Unsupervised Discovery of Parts, Structure, and Dynamics☆46Dec 26, 2022Updated 3 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Jul 9, 2020Updated 5 years ago
- ☆21Nov 5, 2024Updated last year
- PyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019☆47Dec 3, 2019Updated 6 years ago
- [ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering☆133Oct 25, 2022Updated 3 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆69Jun 10, 2020Updated 5 years ago
- Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos☆71Sep 7, 2021Updated 4 years ago
- ☆38Apr 18, 2024Updated last year
- This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes …☆88Jan 9, 2023Updated 3 years ago
- Cooperative Vision-and-Dialog Navigation☆72Nov 22, 2022Updated 3 years ago
- Code and data for the project "Visually grounded continual learning of compositional semantics"☆22Dec 27, 2022Updated 3 years ago
- Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA☆54Sep 13, 2021Updated 4 years ago
- Codebase for "Exploring the Landscape of Spatial Robustness" (ICML'19, https://arxiv.org/abs/1712.02779).☆25Sep 16, 2019Updated 6 years ago
- ☆10Jan 20, 2021Updated 5 years ago
- An unofficial re-implementation of Graph Structure of Neural Networks (Jiaxuan You · Kaiming He · Jure Leskovec · Saining Xie) ICML 2020☆10Jul 27, 2020Updated 5 years ago
- Code for our project CROWN (Conversational Passage Ranking by Reasoning over Word Networks)☆10Jan 11, 2024Updated 2 years ago
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆20Apr 26, 2020Updated 5 years ago
- Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)☆134Jul 25, 2024Updated last year
- CLEVR-Robot: a reinforcement learning environment combining vision, language and control.☆138Aug 4, 2024Updated last year
- ☆44Mar 8, 2021Updated 4 years ago
- Adversarial Structure Matching for Structured Prediction Tasks☆11Jun 4, 2024Updated last year
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Jul 20, 2020Updated 5 years ago
- ☆14Jul 13, 2021Updated 4 years ago
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆16Nov 21, 2025Updated 3 months ago
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated last year
- Official PyTorch implementation of "SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition"☆104Oct 3, 2023Updated 2 years ago
- ☆26Aug 4, 2020Updated 5 years ago
- Use the Force Luke! Learning to Predict Physical Forces by Simulating Effects [CVPR2020] (https://arxiv.org/pdf/2003.12045.pdf)☆74Oct 3, 2023Updated 2 years ago
- Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.071…☆71Apr 22, 2020Updated 5 years ago