kellyiss / SituFormer
Official implementation of the paper Rethinking the Two-Stage Framework for Grounded Situation Recognition, AAAI 2022.
☆13Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for SituFormer
- Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)☆37Updated 4 years ago
- Codes for ECCV paper: "Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph Generation"☆16Updated 4 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆82Updated 3 years ago
- Code for CVPR 2021 paper: Context-aware Biaffine Localizing Network for Temporal Sentence Grounding☆20Updated 3 years ago
- ☆25Updated 2 years ago
- Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"☆32Updated 2 years ago
- Video Visual Relation Detection via Iterative Inference (ACM MM 2021)☆4Updated 2 years ago
- ☆34Updated 3 years ago
- ☆30Updated 2 years ago
- ☆21Updated 3 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Updated 3 years ago
- This repository provides the dataset introduced by the paper "Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentenc…☆56Updated 4 years ago
- Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"☆21Updated 3 years ago
- [AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding☆89Updated 2 years ago
- Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)☆34Updated 4 years ago
- Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"☆24Updated 2 years ago
- This repo contains code for Invariant Grounding for Video Question Answering☆26Updated last year
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated last year
- Code for Greedy Gradient Ensemble for Visual Question Answering (ICCV 2021, Oral)☆26Updated 2 years ago
- The implementation of "A Simple Baseline for Weakly-Supervised Scene Graph Generation" for ICCV2021☆15Updated 3 years ago
- [ICCV2021] Generic Event Boundary Detection: A Benchmark for Event Segmentation☆68Updated 2 years ago
- [CVPR'2022 Oral] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation☆29Updated last year
- Source code of our TCSVT 2020 paper "Multi-level Knowledge Injecting for Visual Commonsense Reasoning"☆11Updated 2 months ago
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…☆45Updated last year
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆16Updated 5 years ago
- Implementation for MAF: Multimodal Alignment Framework☆43Updated 3 years ago
- Code for the ICCV'21 paper "Context-aware Scene Graph Generation with Seq2Seq Transformers"☆44Updated 2 years ago
- The codes and features of the re-implementation of SIGIR 2021 work "Deconfounded Video Moment Retrieval with Causal Intervention"☆35Updated 3 years ago
- ☆15Updated 3 months ago
- [NeurIPS 2021] Introspective Distillation for Robust Question Answering☆12Updated 2 years ago