szzexpoi / AiR
Official Repository for ECCV 2020 paper "AiR: Attention with Reasoning Capability"
☆49Updated 3 years ago
Alternatives and similar repositories for AiR:
Users that are interested in AiR are comparing it to the libraries listed below
- Repository to contain the code for the CVPR 2020 publication: Multi-Modal Domain Adaptation for Fine-Grained Action Recognition☆61Updated 4 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 3 years ago
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆21Updated 3 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆33Updated 3 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 3 years ago
- Pytorch code for our NeurIPS 2019 paper "Cross-channel Communication Networks"☆41Updated 5 years ago
- ☆33Updated 4 years ago
- Out-of-Distribution Detection for Generalized Zero-Shot Action Recognition☆54Updated 5 years ago
- ☆33Updated 5 months ago
- [ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆164Updated 3 years ago
- code for CVPR-2019 paper: Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statisti…☆63Updated 3 years ago
- ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018☆78Updated 5 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆99Updated 3 years ago
- Scene Graph Prediction with Limited Labels☆54Updated last year
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆62Updated 2 years ago
- PyTorch Implementation for "Asymmetric Cross-Guided Attention Network for Actor and Action Video Segmentation From Natural Language Quer…☆20Updated 4 years ago
- ☆16Updated 2 years ago
- Rank-aware Attention Network from 'The Pros and Cons: Rank-aware Temporal Attention for Skill Determination in Long Videos'☆28Updated 3 years ago
- Attribute-Object Visual Composition using Attributes as Operators☆65Updated 2 years ago
- Code for Weakly Supervised Energy-Based Learning for Action Segmentation (ICCV 2019 Oral)☆64Updated 3 years ago
- Online Meta Adaptation for Fast Video Object Segmentation☆24Updated 6 years ago
- Video Noise Contrastive Estimation☆66Updated last year
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Updated 2 years ago
- ☆40Updated 2 years ago
- Inferring and Executing Programs for Visual Reasoning☆21Updated 6 years ago
- AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition (ICLR 2021)☆33Updated 3 years ago
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Updated 5 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆33Updated 5 years ago
- Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.071…☆69Updated 4 years ago