visiontao / evar
Explainable Video Action Reasoning via Prior Knowledge and State Transitions
☆21Updated 8 months ago
Alternatives and similar repositories for evar:
Users that are interested in evar are comparing it to the libraries listed below
- Compositional Learning for Human Object Interaction☆13Updated 4 years ago
- To keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper☆99Updated 3 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆67Updated 4 years ago
- This repository contains the main baselines introduced in WSSTG (ACL 2019).☆55Updated 7 months ago
- Code for the paper "Detecting visual relations using analogies", ICCV19☆21Updated 5 years ago
- praneeth11009 / LIGHTEN-Learning-Interactions-with-Graphs-and-Hierarchical-TEmporal-Networks-for-HOI☆16Updated 4 years ago
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆20Updated 4 years ago
- Weakly-supervised Action Localization☆49Updated 3 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆61Updated 2 years ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Updated 4 years ago
- ☆33Updated 4 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Updated 3 years ago
- Visual Relation Grounding in Videos (ECCV'20, Spotlight)☆57Updated 2 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆39Updated 2 years ago
- Code for the CVPR 2020 oral paper: Weakly Supervised Visual Semantic Parsing☆35Updated 2 years ago
- Home Action Genome: Cooperative Contrastive Action Understanding☆20Updated 3 years ago
- ☆34Updated 3 years ago
- AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"☆57Updated 3 years ago
- A video database bridging human actions and human-object relationships☆137Updated 4 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 3 years ago
- ☆88Updated 3 years ago
- Code for the Scene Graph Generation part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contexts"☆122Updated 6 months ago
- Moments Retrieval Project Webpage (temporal)☆31Updated last year
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆44Updated 7 months ago
- Space-Time Interaction Graph Parsing Networks for Human-Object Interaction Recognition,ACM MM'21☆13Updated 2 years ago
- Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).☆26Updated 2 years ago
- EPIC-KITCHENS-55 baselines for Action Recognition☆75Updated 4 years ago
- [CVPR'19] [PyTorch] Gated Spatio Temporal Energy Graph☆152Updated 5 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆33Updated 5 years ago
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Updated 5 years ago