IIGROUP / PUMLinks

[CVPR 2021] Pytorch implementation for Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation

☆19

Alternatives and similar repositories for PUM

Users that are interested in PUM are comparing it to the libraries listed below

Sorting:

Kien085 / SG2Caps
☆22Updated 3 years ago
huoxingmeishi / Awesome-Scene-Graphs
☆54Updated 5 years ago
YiwuZhong / SGG_from_NLS
[ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"
☆101Updated 2 years ago
yekeren / WSSGG
A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…
☆37Updated 4 years ago
xdshang / VidVRD-II
Video Visual Relation Detection via Iterative Inference (ACM MM 2021)
☆5Updated 3 years ago
wenz116 / DRFT
End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021
☆18Updated 3 years ago
doc-doc / vRGV
Visual Relation Grounding in Videos (ECCV'20, Spotlight)
☆57Updated 2 years ago
jshi31 / WS-SGG
The implementation of "A Simple Baseline for Weakly-Supervised Scene Graph Generation" for ICCV2021
☆15Updated 3 years ago
yytzsy / grounding_changing_distribution
☆34Updated 4 years ago
SijieSong / CVPR21-Cogrounding_semantic_attention
☆14Updated 4 years ago
showlab / Region_Learner
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
☆42Updated 3 years ago
alirezazareian / vspnet
Code for the CVPR 2020 oral paper: Weakly Supervised Visual Semantic Parsing
☆35Updated 2 years ago
iacercalixto / butd-image-captioning
Bottom-up Top-down image captioning model with PyTorch.
☆13Updated 4 years ago
alirezazareian / gbnet
Bridging Knowledge Graphs to Generate Scene Graphs, ECCV 2020
☆69Updated last year
Dawn-LX / VidSGG-BIG
Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…
☆48Updated 2 years ago
praneeth11009 / LIGHTEN-Learning-Interactions-with-Graphs-and-Hierarchical-TEmporal-Networks-for-HOI
☆16Updated 4 years ago
mods333 / energy-based-scene-graph
Code release for Energy-Based Learning for Scene Graph Genertaion
☆94Updated 3 years ago
Vision-CAIR / RelTransformer
☆29Updated last year
layer6ai-labs / SGG-Seq2Seq
Code for the ICCV'21 paper "Context-aware Scene Graph Generation with Seq2Seq Transformers"
☆43Updated 3 years ago
forwchen / HVTG
Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"
☆17Updated 4 years ago
youngfly11 / LCMCG-PyTorch
AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"
☆58Updated 3 years ago
zaynmi / seada-vqa
A pytorch implemetation of data augmentation method for visual question answering
☆21Updated 2 years ago
TheShadow29 / vognet-pytorch
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
☆67Updated 5 years ago
Dawn-LX / VidVRD-tracklets
Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge
☆39Updated 2 years ago
frostinassiky / bsp
Placeholder for code of BSP.
☆11Updated 3 years ago
Dawn-LX / OpenVoc-VidVRD
Official code for the ICLR2023 paper Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
☆43Updated last year
MCG-NJU / TRACE
[ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation
☆58Updated 2 years ago
zfchenUnique / VID-Sentence
This repository provides the dataset introduced by our WSSTG paper
☆12Updated 6 years ago
gujiuxiang / unpaired_image_captioning
Unpaired Image Captioning
☆36Updated 4 years ago
thunlp / VisualDS
☆26Updated 3 years ago