zhengsipeng / VRDFormer_VRDLinks
☆16Updated 2 years ago
Alternatives and similar repositories for VRDFormer_VRD
Users that are interested in VRDFormer_VRD are comparing it to the libraries listed below
Sorting:
- Space-Time Interaction Graph Parsing Networks for Human-Object Interaction Recognition,ACM MM'21☆14Updated 3 years ago
- CVPR2022 Distillation Using Oracle Queries for Transformer-based Human-Object Interaction Detection☆24Updated 3 years ago
- [ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation☆58Updated 3 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆110Updated 2 years ago
- Code for our CVPR 2022 Paper "GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection"☆87Updated last year
- Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining☆30Updated 3 years ago
- Code for our paper "Category Query Learning for Human-Object Interaction Classification" (CVPR2023)☆37Updated 2 years ago
- Repo for CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"☆140Updated 3 years ago
- [ICCV'23] Official PyTorch implementation for paper "Exploring Predicate Visual Context in Detecting Human-Object Interactions"☆86Updated last year
- ECCV2022 Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection☆27Updated 2 years ago
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…☆48Updated 2 years ago
- [CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos☆100Updated 3 years ago
- ☆28Updated last year
- Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021☆208Updated 3 years ago
- A simple and effective feature extractor for untrimmed videos☆13Updated 3 years ago
- CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning☆64Updated last year
- Code for CVPR22 paper: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection.☆49Updated 6 months ago
- The official implementation of Error Detection in Egocentric Procedural Task Videos☆19Updated 2 months ago
- [AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding☆91Updated 3 years ago
- A curated publication list on weakly-supervised temporal action localization☆155Updated 2 years ago
- Code for our CVPR 2022 Paper "Hybrid Relation Guided Set Matching for Few-shot Action Recognition".☆26Updated 2 years ago
- Code for "Mining the Benefits of Two-stage and One-stage HOI Detection"☆89Updated last year
- [CVPR 2022] Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization☆48Updated 2 years ago
- [ESWA 2025] Official pytorch implementation of "What and When to look?: Temporal Span Proposal Network for Video Relation Detection"☆16Updated 4 years ago
- Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos☆27Updated last year
- Span-based Localizing Network for Natural Language Video Localization (ACL 2020)☆110Updated 4 years ago
- Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"☆131Updated 4 years ago
- Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".☆75Updated last year
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆50Updated 2 years ago
- Official implementation of "ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction Detection in Videos" (ACM ICMRW 2021)☆57Updated 3 years ago