zhouhaocv / RLM-Net
☆14Updated 4 years ago
Alternatives and similar repositories for RLM-Net:
Users that are interested in RLM-Net are comparing it to the libraries listed below
- ☆65Updated 4 years ago
- MFURLN relationship detection method☆21Updated 4 years ago
- To keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper☆99Updated 3 years ago
- ECCV2020: Visual Compositional Learning for Human-Object Interaction Detection☆32Updated 3 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆39Updated 2 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆33Updated 5 years ago
- This repository contains the main baselines introduced in WSSTG (ACL 2019).☆55Updated 7 months ago
- Pytorch Implementation of Videos as Space-Time Region Graphs☆26Updated 6 months ago
- Code for CVPR'21 paper "Weakly Supervised Action Selection Learning in Video"☆22Updated 3 years ago
- The official PyTorch code for "Relation-aware Instance Refinement for Weakly Supervised Visual Grounding" accepted by CVPR2021☆26Updated 3 years ago
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆49Updated last year
- A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"☆96Updated 5 years ago
- ☆15Updated 2 years ago
- Official code for "Detecting Human-Object Interactions with Action Co-occurrence Priors☆33Updated 3 years ago
- ☆40Updated 2 years ago
- Implementation for Bottom-Up Temporal Action Localization with Mutual Regularization (ECCV2020)☆47Updated 4 years ago
- Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)☆34Updated 4 years ago
- Source code of our TCSVT 2020 paper "Multi-level Knowledge Injecting for Visual Commonsense Reasoning"☆11Updated 5 months ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Updated 3 years ago
- AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"☆57Updated 3 years ago
- Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos☆69Updated 3 years ago
- A novel Participation-Contributed Temporal Dynamic Model for Group Activity Recognition☆25Updated 4 years ago
- [ECCV 2020] DRG: Dual Relation Graph for Human-Object Interaction Detection☆66Updated 2 years ago
- ECCV2020 Polysemy Deciphering Network for Human-Object Interaction Detection☆19Updated 3 years ago
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Updated 5 years ago
- ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language☆36Updated 4 years ago
- A strong HOI Detection model without Frills!☆59Updated 5 years ago
- STPN - Weakly Supervised Action Localization by Sparse Temporal Pooling Network☆82Updated 6 years ago
- ☆16Updated 4 years ago
- [CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)☆137Updated 2 years ago