visiontao / evar
Explainable Video Action Reasoning via Prior Knowledge and State Transitions
☆21Updated 9 months ago
Alternatives and similar repositories for evar:
Users that are interested in evar are comparing it to the libraries listed below
- Compositional Learning for Human Object Interaction☆13Updated 4 years ago
- Code for the paper "Detecting visual relations using analogies", ICCV19☆21Updated 5 years ago
- To keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper☆100Updated 3 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆61Updated 3 years ago
- Code for the CVPR 2020 oral paper: Weakly Supervised Visual Semantic Parsing☆35Updated 2 years ago
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆20Updated 4 years ago
- Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).☆26Updated 2 years ago
- praneeth11009 / LIGHTEN-Learning-Interactions-with-Graphs-and-Hierarchical-TEmporal-Networks-for-HOI☆16Updated 4 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆67Updated 4 years ago
- Space-Time Interaction Graph Parsing Networks for Human-Object Interaction Recognition,ACM MM'21☆14Updated 2 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Updated 3 years ago
- Code for our paper "Attention-Translation-Relation Network for Scalable Scene Graph Generation", SGRL - ICCV 2019☆15Updated 5 years ago
- [ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation☆58Updated 2 years ago
- A video database bridging human actions and human-object relationships☆138Updated 4 years ago
- Home Action Genome: Cooperative Contrastive Action Understanding☆20Updated 3 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 3 years ago
- Visual Relation Grounding in Videos (ECCV'20, Spotlight)☆57Updated 2 years ago
- This repository contains the main baselines introduced in WSSTG (ACL 2019).☆55Updated 8 months ago
- Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'☆146Updated last year
- ☆34Updated 5 years ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Updated 4 years ago
- EPIC-KITCHENS-55 baselines for Action Recognition☆75Updated 4 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆39Updated 2 years ago
- ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018☆78Updated 6 years ago
- Video Visual Relation Detection via Iterative Inference (ACM MM 2021)☆5Updated 3 years ago
- Code for reproducing the results in "Learning to Detect Human-Object Interactions"☆66Updated 9 months ago
- [CVPR'19] [PyTorch] Gated Spatio Temporal Energy Graph☆152Updated 5 years ago
- Code for Weakly Supervised Energy-Based Learning for Action Segmentation (ICCV 2019 Oral)☆64Updated 3 years ago
- This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"☆17Updated 5 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆44Updated 9 months ago