visiontao / evar
Explainable Video Action Reasoning via Prior Knowledge and State Transitions
☆21Updated 10 months ago
Alternatives and similar repositories for evar
Users that are interested in evar are comparing it to the libraries listed below
Sorting:
- Compositional Learning for Human Object Interaction☆13Updated 4 years ago
- Code for the paper "Detecting visual relations using analogies", ICCV19☆21Updated 5 years ago
- praneeth11009 / LIGHTEN-Learning-Interactions-with-Graphs-and-Hierarchical-TEmporal-Networks-for-HOI☆16Updated 4 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆67Updated 4 years ago
- To keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper☆100Updated 3 years ago
- Weakly-supervised Action Localization☆49Updated 4 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆62Updated 3 years ago
- Code for the CVPR 2020 oral paper: Weakly Supervised Visual Semantic Parsing☆35Updated 2 years ago
- Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'☆146Updated last year
- Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).☆26Updated 3 years ago
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆20Updated 5 years ago
- This repository contains the main baselines introduced in WSSTG (ACL 2019).☆55Updated 10 months ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Updated 4 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 3 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆34Updated 5 years ago
- Space-Time Interaction Graph Parsing Networks for Human-Object Interaction Recognition,ACM MM'21☆14Updated 3 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆33Updated 3 years ago
- This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"☆17Updated 5 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆39Updated 2 years ago
- Code for Weakly Supervised Energy-Based Learning for Action Segmentation (ICCV 2019 Oral)☆64Updated 3 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Updated 4 years ago
- Visual Relation Grounding in Videos (ECCV'20, Spotlight)☆57Updated 2 years ago
- Weakly-Supervised Action Segmentation with Iterative Soft Boundary Assignment (CVPR 2018)☆41Updated 7 years ago
- Home Action Genome: Cooperative Contrastive Action Understanding☆20Updated 3 years ago
- ☆30Updated 6 years ago
- ☆34Updated 4 years ago
- Dense Regression Network for Video Grounding (CVPR2020)☆50Updated 4 years ago
- ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018☆78Updated 6 years ago
- Moments Retrieval Project Webpage (temporal)☆31Updated last year
- [ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation☆58Updated 2 years ago