catalina17 / VideoNavQA
An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)
☆24Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for VideoNavQA
- cordial-sync is a software package than can be used to reproduce the results from the paper "A Cordial Sync: Going Beyond Marginal Polici…☆37Updated 3 years ago
- "CoPhy: Counterfactual Learning of Physical Dynamics", F. Baradel, N. Neverova, J. Mille, G. Mori, C. Wolf, ICLR'2020☆33Updated 4 years ago
- Code Repository for Regression Planning Networks☆59Updated 3 months ago
- Code, data and benchmark from the paper "Unmasking the Inductive Biases of Unsupervised Object Representations for Video Sequences".☆36Updated 3 years ago
- Code for the "Relational Neural Expectation Maximization: Unsupervised Discovery of Objects and their Interactions" paper.☆73Updated last year
- Official code for the paper "Learning Transition Policies for Composing Complex Skills" (ICLR 2019)☆74Updated 5 years ago
- Code for "Unsupervised Visuomotor Control through Distributional Planning Networks"☆10Updated 5 years ago
- An implementation of the MONet model for unsupervised scene decomposition in PyTorch☆58Updated 2 years ago
- Official Release of NeurIPS 2020 Spotlight paper "Generative Neurosymbolic Machines"☆35Updated 8 months ago
- Using Natural Language for Reward Shaping in Reinforcement Learning☆23Updated 10 months ago
- ☆32Updated 6 years ago
- Entity Abstraction in Visual Model-Based Reinforcement Learning☆55Updated 3 years ago
- EfficientMORL (ICML'21)☆22Updated 3 years ago
- Burgess et al. "MONet: Unsupervised Scene Decomposition and Representation"☆89Updated last year
- SCAN: Learning Abstract Hierarchical Compositional Visual Concepts☆55Updated 7 years ago
- Official PyTorch implementation of "Improving Generative Imagination in Object-Centric World Models"☆34Updated last year
- Implementation of Grounded Language Learning in a 3D Simulated World (DeepMind)☆34Updated 7 years ago
- Solving reinforcement learning tasks which require language and vision☆32Updated last year
- PyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019☆49Updated 4 years ago
- This is the dataset generation code for ADEPT (Approximate Derenderer, Extended Physics, and Tracking). http://physadept.csail.mit.edu/☆16Updated 2 years ago
- Code for SplitNet paper☆60Updated 4 years ago
- Code for "Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation"☆62Updated 5 years ago
- Implementation of Random Expert Distillation☆29Updated 5 years ago
- Structured Object-Aware Physics Prediction for Video Modeling and Planning☆32Updated 4 years ago
- Tensorflow models and simulation code for 'ShapeStacks: Learning Vision-Based Physical Intuition for Generalised Object Stacking'☆46Updated last year
- [NeurIPS 2022] Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergen…☆13Updated 2 years ago
- [ICRA 2019] Propagation Networks for Model-based Control Under Partial Observation☆45Updated 5 years ago
- Cornell Instruction Following Framework☆33Updated 3 years ago