Kien085 / SG2CapsLinks
☆23Updated 4 years ago
Alternatives and similar repositories for SG2Caps
Users that are interested in SG2Caps are comparing it to the libraries listed below
Sorting:
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Updated 4 years ago
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…☆48Updated 2 years ago
- ☆15Updated last year
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Updated 5 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 3 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆39Updated 3 years ago
- [ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"☆101Updated 2 years ago
- Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…☆59Updated 2 years ago
- ☆25Updated 3 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated 2 years ago
- The HC-STVG Dataset☆62Updated 2 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆89Updated 4 years ago
- AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"☆58Updated 4 years ago
- Bottom-up Top-down image captioning model with PyTorch.☆14Updated 5 years ago
- [ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset☆90Updated 2 years ago
- Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021☆66Updated 4 years ago
- Official implementation of "Recovering the Unbiased Scene Graphs from the Biased Ones" (ACMMM 2021)☆78Updated 3 years ago
- [ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation☆59Updated 3 years ago
- Visual Relation Grounding in Videos (ECCV'20, Spotlight)☆57Updated 3 years ago
- Unpaired Image Captioning☆36Updated 4 years ago
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆74Updated 5 years ago
- Official implementation of BGNN(CVPR 2021)☆20Updated 4 years ago
- Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"