CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
☆108Dec 18, 2020Updated 5 years ago
Alternatives and similar repositories for CATER
Users that are interested in CATER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repository of NeurIPS2021 paper: PTR☆32Dec 17, 2021Updated 4 years ago
- DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog☆14Jun 10, 2021Updated 4 years ago
- Code for our paper: *Shamsian, *Kleinfeld, Globerson & Chechik, "Learning Object Permanence from Video"☆68Nov 20, 2024Updated last year
- Long-Term Feature Banks for Detailed Video Understanding☆384Aug 30, 2021Updated 4 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆645Aug 30, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Learning Spatial Common Sense with Geometry-Aware Recurrent Networks☆56Dec 16, 2019Updated 6 years ago
- Learning Long-term Visual Dynamics with Region Proposal Interaction Networks (ICLR 2021)☆113May 29, 2022Updated 3 years ago
- PyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019☆47Dec 3, 2019Updated 6 years ago
- An implementation of the MONet model for unsupervised scene decomposition in PyTorch☆59May 16, 2022Updated 3 years ago
- Multi-object image datasets with ground-truth segmentation masks and generative factors.☆284Mar 3, 2026Updated 3 weeks ago
- A repository of common methods, datasets, and tasks for video research☆538Jun 17, 2019Updated 6 years ago
- Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'☆148Aug 25, 2023Updated 2 years ago
- MetaPix: Few-Shot Video Retargeting☆47Dec 22, 2019Updated 6 years ago
- ☆42Jan 22, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Compositional Video Prediction (ICCV19)☆62May 10, 2020Updated 5 years ago
- PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"☆128Nov 6, 2020Updated 5 years ago
- Official PyTorch implementation of GENESIS and GENESIS-V2☆109Apr 13, 2022Updated 3 years ago
- Official PyTorch implementation of "Improving Generative Imagination in Object-Centric World Models"☆37Dec 8, 2022Updated 3 years ago
- Joint-task Self-supervised Learning for Temporal Correspondence (NeurIPS 2019)☆177Mar 12, 2023Updated 3 years ago
- Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks☆17Dec 8, 2022Updated 3 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- DynaVol: Unsupervised Learning for Dynamic Scenes through Object-Centric Voxelization (ICLR2024) & DynaVol-S: Dynamic Scene Understanding…☆21Apr 10, 2025Updated 11 months ago
- Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"☆19Oct 25, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official Release of ICLR 2020 paper "SCALOR: Generative World Models with Scalable Object Representations"☆49Dec 24, 2023Updated 2 years ago
- ☆129Jun 27, 2021Updated 4 years ago
- Learning Correspondence from the Cycle-consistency of Time (CVPR 2019)☆724Jun 26, 2019Updated 6 years ago
- ☆79Apr 17, 2025Updated 11 months ago
- Video Representation Learning by Dense Predictive Coding. Tengda Han, Weidi Xie, Andrew Zisserman.☆254Oct 8, 2021Updated 4 years ago
- ☆14Dec 11, 2018Updated 7 years ago
- RareAct: A video dataset of unusual interactions☆34Aug 4, 2020Updated 5 years ago
- Official PyTorch implementation of "SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition"☆104Oct 3, 2023Updated 2 years ago
- Scaling and Benchmarking Self-Supervised Visual Representation Learning☆587Oct 12, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆35Mar 25, 2020Updated 6 years ago
- "CoPhy: Counterfactual Learning of Physical Dynamics", F. Baradel, N. Neverova, J. Mille, G. Mori, C. Wolf, ICLR'2020☆35Apr 28, 2020Updated 5 years ago
- PyTorch re-implementation of Multi-Object Representation Learning with Iterative Variational Inference☆59Sep 3, 2022Updated 3 years ago
- [CVPR 2020] Temporal Pyramid Network for Action Recognition☆392Jan 12, 2021Updated 5 years ago
- Burgess et al. "MONet: Unsupervised Scene Decomposition and Representation"☆89Dec 3, 2022Updated 3 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Dec 19, 2017Updated 8 years ago
- Official code for Slot-Transformer for Videos (STEVE)☆51Jan 9, 2023Updated 3 years ago