rohitgirdhar / CATERLinks
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
☆104Updated 4 years ago
Alternatives and similar repositories for CATER
Users that are interested in CATER are comparing it to the libraries listed below
Sorting:
- Learning Long-term Visual Dynamics with Region Proposal Interaction Networks (ICLR 2021)☆113Updated 3 years ago
- ☆91Updated 3 years ago
- An implementation of the MONet model for unsupervised scene decomposition in PyTorch☆58Updated 3 years ago
- Attribute-Object Visual Composition using Attributes as Operators☆65Updated 2 years ago
- Burgess et al. "MONet: Unsupervised Scene Decomposition and Representation"☆88Updated 2 years ago
- Code for Learning to Learn Language from Narrated Video☆33Updated last year
- Video Representation Learning by Dense Predictive Coding. Tengda Han, Weidi Xie, Andrew Zisserman.☆251Updated 3 years ago
- Video Noise Contrastive Estimation☆66Updated last year
- Official code for NeurRIPS 2020 paper "Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D"☆29Updated 5 months ago
- A set of neural network modules, which are small fully connected layers operating in semantic concept space. These modules are configured…☆58Updated 3 years ago
- Official PyTorch implementation of "Improving Generative Imagination in Object-Centric World Models"☆35Updated 2 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- [ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆165Updated 4 years ago
- Learning Spatial Common Sense with Geometry-Aware Recurrent Networks☆55Updated 5 years ago
- EfficientMORL (ICML'21)☆22Updated 3 years ago
- Starter kit for working with the EPIC-KITCHENS-55 dataset for action recognition or anticipation☆43Updated 4 years ago
- Code, data and benchmark from the paper "Unmasking the Inductive Biases of Unsupervised Object Representations for Video Sequences".☆36Updated 3 years ago
- Neural Networks that convert input movies into Physical Scene Graphs (PSGs)☆63Updated 4 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 4 years ago
- ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018☆79Updated 6 years ago
- ☆41Updated last year
- Repository for "Space-Time Correspondence as a Contrastive Random Walk" (NeurIPS 2020)☆271Updated 3 years ago
- Like Moving MNIST, but way more flexible☆24Updated 4 years ago
- PyTorch re-implementation of Multi-Object Representation Learning with Iterative Variational Inference☆59Updated 2 years ago
- This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).☆37Updated 10 months ago
- PyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019☆49Updated 5 years ago
- Tensorflow implementation for "Local Aggregation for Unsupervised Learning of Visual Embeddings"☆59Updated 4 years ago
- Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.071…☆71Updated 5 years ago
- Charades Object Detection Dataset (ICCV 2017)☆31Updated 7 years ago
- [ICLR 2019] ]Unsupervised Discovery of Parts, Structure, and Dynamics☆46Updated 2 years ago