HCPLab-SYSU/STKET

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HCPLab-SYSU/STKET)

HCPLab-SYSU / STKET

Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)

☆19

Alternatives and similar repositories for STKET

Users that are interested in STKET are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zjucsq / PLA
View on GitHub
[ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision
☆12Sep 17, 2023Updated 2 years ago
MCG-NJU / TRACE
View on GitHub
[ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation
☆60Aug 27, 2022Updated 3 years ago
GeWu-Lab / TSPM
View on GitHub
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
☆17Oct 25, 2024Updated last year
GeWu-Lab / APPO
View on GitHub
The official repository for CVPR'26 Paper "APPO: Attention-guided Perception Policy Optimization for Video Reasoning"
☆16Mar 19, 2026Updated 4 months ago
Vision-CAIR / RelTransformer
View on GitHub
☆29Oct 4, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jaleedkhan / neusire
View on GitHub
NeuSyRE: A Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment
☆24Mar 10, 2024Updated 2 years ago
HKUST-LongGroup / CFA
View on GitHub
[ICCV 2023] Compositional Feature Augmentation for Unbiased Scene Graph Generation
☆15Dec 5, 2023Updated 2 years ago
ranjaykrishna / GraphViz
View on GitHub
Simple library to automatically visualize graph structures, especially scene graphs.
☆46Jul 25, 2019Updated 7 years ago
doc-doc / CoVGT
View on GitHub
Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)
☆20Mar 9, 2024Updated 2 years ago
xingaoli / DP-HOI
View on GitHub
Disentangled Pre-training for Human-Object Interaction Detection
☆28Sep 17, 2025Updated 10 months ago
CAMMA-public / SurgLatentGraph
View on GitHub
This repository contains the code associated with our 2023 TMI paper "Latent Graph Representations for Critical View of Safety Assessment…
☆38Sep 17, 2025Updated 10 months ago
wdrink / OmniVid
View on GitHub
☆58Jun 4, 2024Updated 2 years ago
rajatkoner08 / rtn
View on GitHub
This is a code repository for Relation Transformer Network
☆13Nov 30, 2021Updated 4 years ago
comic-xr / CoMIC
View on GitHub
☆42May 4, 2026Updated 2 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Evm7 / Tutorials-Computer-Vision
View on GitHub
During my research I usually like to visuallize and understand clearly how some papers/models work. In this repository I will create some…
☆12Apr 7, 2022Updated 4 years ago
sayaknag / unbiasedSGG
View on GitHub
Official Pytorch Implementation of the framework TEMPURA proposed in our paper Unbiased Scene Graph Generation in Videos accepted by CVPR…
☆25Sep 9, 2025Updated 10 months ago
Unakar / Bike-REID
View on GitHub
在监控画质下实现对校园自行车的重识别，包含REID模型识别，向量数据库检索，UI展示
☆11Feb 13, 2024Updated 2 years ago
sail-sg / VGT
View on GitHub
Video Graph Transformer for Video Question Answering (ECCV'22)
☆49Jun 8, 2023Updated 3 years ago
Marinto-Richee / YOLOv8-and-GroundingDINO-for-Real-Time-License-Plate-Detection
View on GitHub
A project using YoloV8 to detect License Plates
☆13Sep 29, 2023Updated 2 years ago
siml3 / HL-Net
View on GitHub
☆13Sep 23, 2023Updated 2 years ago
gpt4vision / OvSGTR
View on GitHub
[ECCV 2024 Best Paper Candidate] Implementation of "Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Vi…
☆104Jul 27, 2025Updated 11 months ago
EPFL-VILAB / adversarial-prompts
View on GitHub
☆14Mar 28, 2024Updated 2 years ago
fyyCS / LSLD
View on GitHub
☆14Nov 13, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rlqja1107 / torch-LLM4SGG
View on GitHub
Official PyTorch implementation Source code for LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation, accepted at …
☆116Jul 18, 2024Updated 2 years ago
schowdhury671 / meerkat
View on GitHub
☆35Jul 9, 2025Updated last year
GeWu-Lab / MS-Bot
View on GitHub
The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)
☆22Jun 25, 2025Updated last year
MCG-NJU / VideoMAE-Action-Detection
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
☆70Feb 3, 2023Updated 3 years ago
franciszchen / SCA-Net
View on GitHub
☆10Oct 7, 2023Updated 2 years ago
Dawn-LX / VidSGG-BIG
View on GitHub
Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…
☆47Jul 11, 2023Updated 3 years ago
aa200647963 / SGG-DHL
View on GitHub
This repository contains code for the paper 'Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation'.
☆17Aug 6, 2022Updated 3 years ago
zyj20 / MPReID
View on GitHub
☆10Dec 16, 2023Updated 2 years ago
ttgeng233 / UnAV
View on GitHub
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)
☆73Jan 4, 2026Updated 6 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
birlrobotics / vs-gats
View on GitHub
Code for paper: Visual-Semantic Graph Attention Networks for Human-Object Interaction Detection. Project page: http://www.juanrojas.net/v…
☆19Mar 3, 2021Updated 5 years ago
zonghai-yao / pagerank
View on GitHub
简单的pagerank基础上加上稀疏化矩阵化并行化等处理
☆12Oct 8, 2019Updated 6 years ago
NingWang2049 / STIGPN
View on GitHub
Space-Time Interaction Graph Parsing Networks for Human-Object Interaction Recognition，ACM MM'21
☆14May 12, 2022Updated 4 years ago
MCG-NJU / Structured-Sparse-RCNN
View on GitHub
[CVPR 2022] Structured Sparse R-CNN for Direct Scene Graph Generation
☆64Jun 7, 2022Updated 4 years ago
KeNiu042 / Diffusion-ReID
View on GitHub
Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training
☆11Jan 23, 2024Updated 2 years ago
guanxiongsun / vfe.pytorch
View on GitHub
Video Feature Enhancement with PyTorch
☆32Nov 28, 2024Updated last year
GeWu-Lab / BML_TPAMI2024
View on GitHub
The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024
☆19Sep 29, 2024Updated last year