zhengsipeng / VRDFormer_VRD
☆13Updated last year
Related projects: ⓘ
- CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning☆56Updated 5 months ago
- Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …☆42Updated 2 years ago
- source code of our MGPN in SIGIR 2022☆18Updated 2 years ago
- Preliminary code for reviewers☆12Updated 3 years ago
- [ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation☆57Updated 2 years ago
- Space-Time Interaction Graph Parsing Networks for Human-Object Interaction Recognition,ACM MM'21☆13Updated 2 years ago
- ☆75Updated 2 years ago
- [AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding☆89Updated last year
- Code for our paper "Category Query Learning for Human-Object Interaction Classification" (CVPR2023)☆34Updated last year
- Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining☆22Updated 2 years ago
- Code for CVPR23 paper: Learning to Generate Language-supervised and Open-vocabulary Scene Graph using Pre-trained Visual-Semantic Space☆35Updated 11 months ago
- Span-based Localizing Network for Natural Language Video Localization (ACL 2020)☆100Updated 2 years ago
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…☆45Updated last year
- ☆29Updated 2 years ago
- CVPR2022 Distillation Using Oracle Queries for Transformer-based Human-Object Interaction Detection☆23Updated 2 years ago
- Code for our CVPR 2022 Paper "GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection"☆82Updated 5 months ago
- ECCV2022 Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection☆27Updated last year
- [CVPR 2023]Official Pytorch code for paper "Prototype-based Embedding Network for Scene Graph Generation"☆43Updated last year
- Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"☆129Updated 3 years ago
- ☆18Updated 10 months ago
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆88Updated last year
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆13Updated last year
- Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)☆51Updated 3 years ago
- Code for "Mining the Benefits of Two-stage and One-stage HOI Detection"☆89Updated 5 months ago
- ☆30Updated 9 months ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆49Updated 2 months ago
- ☆16Updated 2 years ago
- Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作☆31Updated 2 years ago
- Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval☆93Updated 2 years ago
- "Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022.☆63Updated 2 years ago