yrcong / STTran
Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021
☆186Updated 2 years ago
Related projects: ⓘ
- A video database bridging human actions and human-object relationships☆127Updated 4 years ago
- ☆182Updated last year
- [CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers☆166Updated 11 months ago
- ☆75Updated 2 years ago
- [ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation☆57Updated 2 years ago
- Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"☆129Updated 3 years ago
- Code for "Mining the Benefits of Two-stage and One-stage HOI Detection"☆89Updated 5 months ago
- Code for our CVPR 2022 Paper "GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection"☆82Updated 5 months ago
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…☆45Updated last year
- The toolkit for scene graph generation☆71Updated 2 years ago
- [CVPR 2023]Official Pytorch code for paper "Prototype-based Embedding Network for Scene Graph Generation"☆43Updated last year
- Code for the ECCV'22 paper "Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos".☆24Updated 7 months ago
- image scene graph generation benchmark☆385Updated 2 years ago
- [ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"☆98Updated last year
- Space-Time Interaction Graph Parsing Networks for Human-Object Interaction Recognition,ACM MM'21☆13Updated 2 years ago
- ICCV 2021: A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph ge…☆60Updated 2 years ago
- Repo for CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"☆133Updated 2 years ago
- This is the code of ECCV 2022 (Oral) paper "Fine-Grained Scene Graph Generation with Data Transfer".☆91Updated last year
- Span-based Localizing Network for Natural Language Video Localization (ACL 2020)☆100Updated 2 years ago
- Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"☆239Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆98Updated last year
- Official implementation of "ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction Detection in Videos" (ACM ICMRW 2021)☆50Updated 2 years ago
- [CVPR 2022] Structured Sparse R-CNN for Direct Scene Graph Generation☆57Updated 2 years ago
- Code for CVPR23 paper: Learning to Generate Language-supervised and Open-vocabulary Scene Graph using Pre-trained Visual-Semantic Space☆35Updated 11 months ago
- ☆13Updated last year
- Code for CVPR22 paper: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection.☆44Updated last month
- [CVPR'22] Official PyTorch implementation for paper "Efficient Two-Stage Detection of Human–Object Interactions with a Novel Unary–Pairwi…☆145Updated last year
- Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization.☆128Updated last year
- [CVPR2023] All in One: Exploring Unified Video-Language Pre-training☆278Updated last year
- [CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos☆92Updated last year