Disentangled Pre-training for Human-Object Interaction Detection
☆27Sep 17, 2025Updated 5 months ago
Alternatives and similar repositories for DP-HOI
Users that are interested in DP-HOI are comparing it to the libraries listed below
Sorting:
- [NeurIPS2023] Neural-Logic Human-Object Interaction Detection☆14Aug 24, 2024Updated last year
- ☆22Oct 21, 2024Updated last year
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆69Mar 14, 2024Updated last year
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"☆66Aug 9, 2023Updated 2 years ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆28Nov 8, 2023Updated 2 years ago
- ☆29Jul 2, 2024Updated last year
- The official code for Relational Context Learning for Human-Object Interaction Detection, CVPR2023.☆54Apr 23, 2023Updated 2 years ago
- ☆20Jun 6, 2024Updated last year
- Official code of ACM MM2024 paper- Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection☆24Aug 15, 2024Updated last year
- [NeurIPS 2024] Official code for paper "EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection"☆42Jul 7, 2025Updated 7 months ago
- Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".☆11Nov 13, 2024Updated last year
- [ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training☆135May 28, 2024Updated last year
- [ICCV'23] Official PyTorch implementation for paper "Exploring Predicate Visual Context in Detecting Human-Object Interactions"☆89Jul 4, 2024Updated last year
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- Code of the Grounded MUIE model, REAMO☆11Dec 3, 2024Updated last year
- Official PyTorch implementation of the paper Transformer-Based Image Generation from Scene Graphs https://arxiv.org/abs/2303.04634☆19Jan 30, 2024Updated 2 years ago
- Code for our CVPR 2022 Paper "GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection"☆88Mar 31, 2024Updated last year
- Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)☆19Mar 13, 2024Updated last year
- ☆17Dec 13, 2023Updated 2 years ago
- Code for our paper "Category Query Learning for Human-Object Interaction Classification" (CVPR2023)☆37Jul 9, 2023Updated 2 years ago
- [ICCV'2023] Compositional Feature Augmentation for Unbiased Scene Graph Generation☆15Dec 5, 2023Updated 2 years ago
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆23Jun 13, 2025Updated 8 months ago
- ☆34May 12, 2025Updated 9 months ago
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆49Jan 8, 2025Updated last year
- Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)☆22Sep 27, 2023Updated 2 years ago
- Code for "Mining the Benefits of Two-stage and One-stage HOI Detection"☆90Mar 31, 2024Updated last year
- ☆19Oct 22, 2023Updated 2 years ago
- mask2former psg☆22Dec 12, 2022Updated 3 years ago
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)☆52Jul 16, 2024Updated last year
- Official Pytorch Implementation of the framework TEMPURA proposed in our paper Unbiased Scene Graph Generation in Videos accepted by CVPR…☆24Sep 9, 2025Updated 5 months ago
- EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams [CVPR'24]☆32Jul 23, 2025Updated 7 months ago
- [BMVC 2024] Official implementation of Align-DETR☆61Jul 24, 2024Updated last year
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆25Feb 2, 2025Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆41Aug 4, 2025Updated 7 months ago
- Utilities for the human-object interaction detection dataset HICO-DET☆63Dec 14, 2023Updated 2 years ago
- ECCV2022 Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection☆27May 26, 2023Updated 2 years ago
- Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM☆79Apr 19, 2025Updated 10 months ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 2 years ago