HCPLab-SYSU / STKETView external linksLinks
Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)
☆19Mar 13, 2024Updated last year
Alternatives and similar repositories for STKET
Users that are interested in STKET are comparing it to the libraries listed below
Sorting:
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.☆16Oct 25, 2024Updated last year
- [ICCV'2023] Compositional Feature Augmentation for Unbiased Scene Graph Generation☆15Dec 5, 2023Updated 2 years ago
- Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)☆19Mar 9, 2024Updated last year
- Benchmarking Panoptic Video Scene Graph Generation (PVSG), CVPR'23☆102Apr 30, 2024Updated last year
- Official Pytorch Implementation of the framework TEMPURA proposed in our paper Unbiased Scene Graph Generation in Videos accepted by CVPR…☆24Sep 9, 2025Updated 5 months ago
- [SIGGRAPH2025] Generative Video Matting☆57Aug 12, 2025Updated 6 months ago
- ☆31Mar 5, 2025Updated 11 months ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆23Oct 20, 2023Updated 2 years ago
- Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)☆70Jan 4, 2026Updated last month
- Video Feature Enhancement with PyTorch☆32Nov 28, 2024Updated last year
- ☆29Oct 4, 2023Updated 2 years ago
- This repository contains the code associated with our 2023 TMI paper "Latent Graph Representations for Critical View of Safety Assessment…☆34Sep 17, 2025Updated 4 months ago
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)☆32Mar 29, 2024Updated last year
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- ☆13Jul 17, 2021Updated 4 years ago
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…☆150Aug 21, 2024Updated last year
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- ☆11Jan 18, 2025Updated last year
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 3 months ago
- GraphSleepNet: Adaptive Spatial-Temporal Graph Convolutional Networks for Sleep Stage Classification☆12Jul 24, 2020Updated 5 years ago
- A collection for basic machine learning and data mining model implementations, in Python, mainly referencing the books: *Machine Learning…☆13Jul 15, 2021Updated 4 years ago
- ☆10Dec 16, 2023Updated 2 years ago
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆10Feb 13, 2024Updated 2 years ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated last year
- A drag-and-drop-enabled, responsive, envelope graph that allows to shape a wave with attack, decay, sustain and release☆11Jan 5, 2023Updated 3 years ago
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆49Jan 8, 2025Updated last year
- LSTC: Boosting Atomic Action Detection with Long-Short-Term Context☆10Sep 1, 2022Updated 3 years ago
- SDN application for implementing the shortest path routing using RYU SDN controller☆10Jul 23, 2019Updated 6 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training☆11Jan 23, 2024Updated 2 years ago
- Pytorch implementation of A Simple yet Effective Pipeline for Radial Distortion Correction☆10Feb 4, 2020Updated 6 years ago
- Github mirror of MediaWiki extension TextExtracts - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Dev…☆15Feb 5, 2026Updated last week
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆37Oct 9, 2025Updated 4 months ago
- Web scraping with Python Scrapy Projects☆11Jan 20, 2021Updated 5 years ago
- CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery☆14Dec 18, 2025Updated last month
- Rust tool to get info from your lycamobile.es account☆10Apr 29, 2021Updated 4 years ago