rohitgirdhar / CATERLinks
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
☆105Updated 4 years ago
Alternatives and similar repositories for CATER
Users that are interested in CATER are comparing it to the libraries listed below
Sorting:
- Learning Long-term Visual Dynamics with Region Proposal Interaction Networks (ICLR 2021)☆113Updated 3 years ago
- ☆91Updated 3 years ago
- An implementation of the MONet model for unsupervised scene decomposition in PyTorch☆58Updated 3 years ago
- Video Noise Contrastive Estimation☆66Updated last year
- Attribute-Object Visual Composition using Attributes as Operators☆65Updated 2 years ago
- Video Representation Learning by Dense Predictive Coding. Tengda Han, Weidi Xie, Andrew Zisserman.☆251Updated 3 years ago
- Burgess et al. "MONet: Unsupervised Scene Decomposition and Representation"☆88Updated 2 years ago
- [ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆165Updated 4 years ago
- Code, data and benchmark from the paper "Unmasking the Inductive Biases of Unsupervised Object Representations for Video Sequences".☆36Updated 3 years ago
- Code for Learning to Learn Language from Narrated Video☆33Updated last year
- Repository for "Space-Time Correspondence as a Contrastive Random Walk" (NeurIPS 2020)☆271Updated 3 years ago
- ☆41Updated last year
- ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018☆81Updated 6 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- EfficientMORL (ICML'21)☆21Updated 3 years ago
- A set of neural network modules, which are small fully connected layers operating in semantic concept space. These modules are configured…☆58Updated 3 years ago
- PyTorch re-implementation of Multi-Object Representation Learning with Iterative Variational Inference☆59Updated 2 years ago
- Official code for NeurRIPS 2020 paper "Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D"☆30Updated 6 months ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 4 years ago
- Starter kit for working with the EPIC-KITCHENS-55 dataset for action recognition or anticipation☆43Updated 5 years ago
- Neural Networks that convert input movies into Physical Scene Graphs (PSGs)☆63Updated 4 years ago
- Learning Spatial Common Sense with Geometry-Aware Recurrent Networks☆55Updated 5 years ago
- Official PyTorch implementation of "Improving Generative Imagination in Object-Centric World Models"☆37Updated 2 years ago
- Official PyTorch implementation of GENESIS and GENESIS-V2☆110Updated 3 years ago
- Tensorflow implementation for "Local Aggregation for Unsupervised Learning of Visual Embeddings"☆59Updated 4 years ago
- Library for the training and evaluation of object-centric models (ICML 2022)☆68Updated 2 years ago
- Official PyTorch implementation of "SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition"☆104Updated last year
- Learning to Decompose and Disentangle Representations for Video Prediction, NIPS 2018☆135Updated 4 years ago
- Scene Graph Prediction with Limited Labels☆55Updated last year
- PyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019☆49Updated 5 years ago