zhengsipeng / VRDFormer_VRD
☆15Updated last year
Alternatives and similar repositories for VRDFormer_VRD:
Users that are interested in VRDFormer_VRD are comparing it to the libraries listed below
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…☆46Updated last year
- Space-Time Interaction Graph Parsing Networks for Human-Object Interaction Recognition,ACM MM'21☆14Updated 2 years ago
- Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining☆26Updated 2 years ago
- CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning☆61Updated 11 months ago
- [ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation☆59Updated 2 years ago
- ECCV2022 Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection☆27Updated last year
- ☆31Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆100Updated last year
- CVPR2022 Distillation Using Oracle Queries for Transformer-based Human-Object Interaction Detection☆25Updated 2 years ago
- Official pytorch implementation of "What and When to look?: Temporal Span Proposal Network for Video Relation Detection"☆16Updated 3 years ago
- Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …☆42Updated 2 years ago
- [CVPR 2022] Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization☆47Updated last year
- [CVPR 2023]Official Pytorch code for paper "Prototype-based Embedding Network for Scene Graph Generation"☆53Updated last year
- Code for CVPR23 paper: Learning to Generate Language-supervised and Open-vocabulary Scene Graph using Pre-trained Visual-Semantic Space☆35Updated last year
- [AAAI 2022] DCAN: Improving Temporal Action Detection via Dual Context Aggregation☆18Updated 2 years ago
- Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"☆34Updated 2 years ago
- Code for our CVPR 2022 Paper "GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection"☆84Updated 11 months ago
- Official implementation of "ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction Detection in Videos" (ACM ICMRW 2021)☆50Updated 2 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆39Updated 2 years ago
- Code for our paper "Category Query Learning for Human-Object Interaction Classification" (CVPR2023)☆35Updated last year
- ☆16Updated last year
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆95Updated 2 years ago
- A simple and effective feature extractor for untrimmed videos☆13Updated 2 years ago
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆46Updated last year
- Repo for CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"☆137Updated 2 years ago
- [CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection☆83Updated 2 years ago
- [AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding☆90Updated 2 years ago
- Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作☆28Updated 3 years ago
- [CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos☆95Updated 2 years ago
- "Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022☆68Updated 2 years ago