rohitgirdhar / CATER
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
☆103Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for CATER
- Learning Long-term Visual Dynamics with Region Proposal Interaction Networks (ICLR 2021)☆112Updated 2 years ago
- Burgess et al. "MONet: Unsupervised Scene Decomposition and Representation"☆89Updated last year
- ☆86Updated 2 years ago
- An implementation of the MONet model for unsupervised scene decomposition in PyTorch☆58Updated 2 years ago
- Official PyTorch implementation of GENESIS and GENESIS-V2☆97Updated 2 years ago
- Video Noise Contrastive Estimation☆65Updated last year
- Code for Learning to Learn Language from Narrated Video☆33Updated last year
- Video Representation Learning by Dense Predictive Coding. Tengda Han, Weidi Xie, Andrew Zisserman.☆251Updated 3 years ago
- Attribute-Object Visual Composition using Attributes as Operators☆64Updated last year
- Official PyTorch implementation of "Improving Generative Imagination in Object-Centric World Models"☆34Updated last year
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- A set of neural network modules, which are small fully connected layers operating in semantic concept space. These modules are configured…☆58Updated 3 years ago
- Official PyTorch implementation of "SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition"☆101Updated last year
- ☆39Updated 9 months ago
- EfficientMORL (ICML'21)☆22Updated 3 years ago
- [ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆164Updated 3 years ago
- Official code for NeurRIPS 2020 paper "Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D"☆26Updated last year
- Code, data and benchmark from the paper "Unmasking the Inductive Biases of Unsupervised Object Representations for Video Sequences".☆36Updated 3 years ago
- Like Moving MNIST, but way more flexible☆24Updated 4 years ago
- ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018☆76Updated 5 years ago
- Learning Spatial Common Sense with Geometry-Aware Recurrent Networks☆55Updated 4 years ago
- PyTorch re-implementation of Multi-Object Representation Learning with Iterative Variational Inference☆58Updated 2 years ago
- Starter kit for working with the EPIC-KITCHENS-55 dataset for action recognition or anticipation☆43Updated 4 years ago
- PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation☆118Updated last year
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆29Updated 2 years ago
- Neural Networks that convert input movies into Physical Scene Graphs (PSGs)☆62Updated 3 years ago
- Library for the training and evaluation of object-centric models (ICML 2022)☆65Updated last year
- PyTorch code for CVPR 2019 paper: The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation☆124Updated last year