visiontao / evarLinks
Explainable Video Action Reasoning via Prior Knowledge and State Transitions
☆21Updated last year
Alternatives and similar repositories for evar
Users that are interested in evar are comparing it to the libraries listed below
Sorting:
- To keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper☆100Updated 3 years ago
- Compositional Learning for Human Object Interaction☆13Updated 4 years ago
- A video database bridging human actions and human-object relationships☆146Updated 5 years ago
- Code for the paper "Detecting visual relations using analogies", ICCV19☆21Updated 5 years ago
- praneeth11009 / LIGHTEN-Learning-Interactions-with-Graphs-and-Hierarchical-TEmporal-Networks-for-HOI☆16Updated 4 years ago
- Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'☆147Updated last year
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆20Updated 5 years ago
- This repository contains the main baselines introduced in WSSTG (ACL 2019).☆55Updated last year
- Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).☆25Updated 3 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆68Updated 5 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆62Updated 3 years ago
- [CVPR'19] [PyTorch] Gated Spatio Temporal Energy Graph☆152Updated 5 years ago
- Moments Retrieval Project Webpage (temporal)☆31Updated last year
- Code for the Paper: Antonino Furnari and Giovanni Maria Farinella. What Would You Expect? Anticipating Egocentric Actions with Rolling-Un…☆132Updated last year
- Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…☆104Updated 5 years ago
- ☆34Updated 4 years ago
- ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018☆82Updated 6 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 4 years ago
- ☆91Updated 3 years ago
- [ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation☆58Updated 2 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"