rohitgirdhar / CATER
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
☆103Updated 4 years ago
Alternatives and similar repositories for CATER:
Users that are interested in CATER are comparing it to the libraries listed below
- Learning Long-term Visual Dynamics with Region Proposal Interaction Networks (ICLR 2021)☆112Updated 2 years ago
- ☆89Updated 3 years ago
- An implementation of the MONet model for unsupervised scene decomposition in PyTorch☆58Updated 2 years ago
- Burgess et al. "MONet: Unsupervised Scene Decomposition and Representation"☆88Updated 2 years ago
- Attribute-Object Visual Composition using Attributes as Operators☆65Updated 2 years ago
- Video Noise Contrastive Estimation☆66Updated last year
- Code for Learning to Learn Language from Narrated Video☆33Updated last year
- Official PyTorch implementation of "Improving Generative Imagination in Object-Centric World Models"☆35Updated 2 years ago
- Video Representation Learning by Dense Predictive Coding. Tengda Han, Weidi Xie, Andrew Zisserman.☆251Updated 3 years ago
- ☆41Updated last year
- Starter kit for working with the EPIC-KITCHENS-55 dataset for action recognition or anticipation☆43Updated 4 years ago
- This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).☆37Updated 7 months ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- Official code for NeurRIPS 2020 paper "Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D"☆27Updated 2 months ago
- [ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆164Updated 3 years ago
- PyTorch re-implementation of Multi-Object Representation Learning with Iterative Variational Inference☆59Updated 2 years ago
- PyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019☆48Updated 5 years ago
- Code, data and benchmark from the paper "Unmasking the Inductive Biases of Unsupervised Object Representations for Video Sequences".☆36Updated 3 years ago
- Self-supervised learning through the eyes of a child☆139Updated 3 years ago
- Learning Spatial Common Sense with Geometry-Aware Recurrent Networks☆55Updated 5 years ago
- Official PyTorch implementation of "SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition"☆102Updated last year
- ☆38Updated 2 years ago
- PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"☆115Updated 4 years ago
- Learning to Decompose and Disentangle Representations for Video Prediction, NIPS 2018☆135Updated 3 years ago
- Tensorflow implementation for "Local Aggregation for Unsupervised Learning of Visual Embeddings"☆58Updated 4 years ago
- A set of neural network modules, which are small fully connected layers operating in semantic concept space. These modules are configured…☆58Updated 3 years ago
- EfficientMORL (ICML'21)☆22Updated 3 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 3 years ago
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆29Updated 2 years ago
- Library for the training and evaluation of object-centric models (ICML 2022)☆68Updated last year