visiontao / evar
Explainable Video Action Reasoning via Prior Knowledge and State Transitions
☆21Updated 10 months ago
Alternatives and similar repositories for evar:
Users that are interested in evar are comparing it to the libraries listed below
- Compositional Learning for Human Object Interaction☆13Updated 4 years ago
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆20Updated 4 years ago
- To keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper☆100Updated 3 years ago
- This repository contains the main baselines introduced in WSSTG (ACL 2019).☆55Updated 9 months ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆67Updated 4 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆62Updated 3 years ago
- Code for the paper "Detecting visual relations using analogies", ICCV19☆21Updated 5 years ago
- Moments Retrieval Project Webpage (temporal)☆31Updated last year
- praneeth11009 / LIGHTEN-Learning-Interactions-with-Graphs-and-Hierarchical-TEmporal-Networks-for-HOI☆16Updated 4 years ago
- This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"☆17Updated 5 years ago
- ☆34Updated 4 years ago
- Weakly-supervised Action Localization☆49Updated 3 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆45Updated 9 months ago
- Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).☆26Updated 3 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 3 years ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Updated 4 years ago
- ☆34Updated 5 years ago
- Visual Relation Grounding in Videos (ECCV'20, Spotlight)☆57Updated 2 years ago
- ☆30Updated 6 years ago
- Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization (ECCV 2020)☆47Updated last year
- Evaluation measures for the EPIC-KITCHENS-100 Action Detection challenge☆16Updated last year
- STPN - Weakly Supervised Action Localization by Sparse Temporal Pooling Network☆82Updated 6 years ago
- A video database bridging human actions and human-object relationships☆142Updated 4 years ago
- Code for our paper "Attention-Translation-Relation Network for Scalable Scene Graph Generation", SGRL - ICCV 2019☆15Updated 5 years ago
- Code for Weakly Supervised Energy-Based Learning for Action Segmentation (ICCV 2019 Oral)☆64Updated 3 years ago
- Weakly Supervised Temporal Action Localization Using Deep Metric Learning☆28Updated 5 years ago
- Code for reproducing the results in "Learning to Detect Human-Object Interactions"☆66Updated 10 months ago
- Implementation for Bottom-Up Temporal Action Localization with Mutual Regularization (ECCV2020)☆47Updated 4 years ago
- Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…☆104Updated 5 years ago
- Dense Regression Network for Video Grounding (CVPR2020)☆50Updated 4 years ago