Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)
☆19Mar 13, 2024Updated last year
Alternatives and similar repositories for STKET
Users that are interested in STKET are comparing it to the libraries listed below
Sorting:
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.☆16Oct 25, 2024Updated last year
- [ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation☆59Aug 27, 2022Updated 3 years ago
- [ICCV'2023] Compositional Feature Augmentation for Unbiased Scene Graph Generation☆15Dec 5, 2023Updated 2 years ago
- EraseAnything, ICML 2025☆39Sep 28, 2025Updated 5 months ago
- Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)☆19Mar 9, 2024Updated last year
- Disentangled Pre-training for Human-Object Interaction Detection☆27Sep 17, 2025Updated 5 months ago
- Benchmarking Panoptic Video Scene Graph Generation (PVSG), CVPR'23☆102Apr 30, 2024Updated last year
- Official Pytorch Implementation of the framework TEMPURA proposed in our paper Unbiased Scene Graph Generation in Videos accepted by CVPR…☆24Sep 9, 2025Updated 5 months ago
- ☆54Jun 4, 2024Updated last year
- [SIGGRAPH2025] Generative Video Matting☆58Aug 12, 2025Updated 6 months ago
- Code of the paper Relation-enhanced Negative Sampling for Multimodal Knowledge Graph Completion (ACM MM22))☆26May 22, 2024Updated last year
- ☆31Mar 5, 2025Updated last year
- [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics☆38Sep 10, 2025Updated 5 months ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Oct 20, 2023Updated 2 years ago
- Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)☆70Jan 4, 2026Updated 2 months ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- Video Feature Enhancement with PyTorch☆32Nov 28, 2024Updated last year
- ☆29Oct 4, 2023Updated 2 years ago
- This repository contains the code associated with our 2023 TMI paper "Latent Graph Representations for Critical View of Safety Assessment…☆35Sep 17, 2025Updated 5 months ago
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)☆32Mar 29, 2024Updated last year
- ☆13Jul 17, 2021Updated 4 years ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- [ECCV 2024 Best Paper Candidate] Implementation of "Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Vi…☆95Jul 27, 2025Updated 7 months ago
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…☆151Aug 21, 2024Updated last year
- ☆36Jul 9, 2025Updated 7 months ago
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- GraphSleepNet: Adaptive Spatial-Temporal Graph Convolutional Networks for Sleep Stage Classification☆12Jul 24, 2020Updated 5 years ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆19Jul 10, 2025Updated 7 months ago
- ☆10Dec 16, 2023Updated 2 years ago
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- ☆11Jan 18, 2025Updated last year
- ☆11Sep 25, 2022Updated 3 years ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- A collection for basic machine learning and data mining model implementations, in Python, mainly referencing the books: *Machine Learning…☆13Jul 15, 2021Updated 4 years ago
- Course asset for the VR Developer Nanodegree > VR Scenes & Objects > Game Objects lesson☆11Jun 28, 2022Updated 3 years ago
- A drag-and-drop-enabled, responsive, envelope graph that allows to shape a wave with attack, decay, sustain and release☆11Jan 5, 2023Updated 3 years ago