Vision-CAIR / RelTransformer
β29Updated last year
Alternatives and similar repositories for RelTransformer:
Users that are interested in RelTransformer are comparing it to the libraries listed below
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detectionβ24Updated 3 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)β36Updated 2 years ago
- π΄ββοΈ ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection (MM 2020)β32Updated last year
- The 1st place solution of 2022 Ego4d Natural Language Queries.β32Updated 2 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challengeβ39Updated 2 years ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022β35Updated last year
- β22Updated 3 years ago
- Official implementation of BGNN(CVPR 2021)β20Updated 3 years ago
- β35Updated last year
- β47Updated 2 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervisionβ¦β37Updated 3 years ago
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"β36Updated last year
- CVPR2021: Detecting Human-Object Interaction via Fabricated Compositional Learningβ16Updated 3 years ago
- Series of work (ECCV2020, CVPR2021, CVPR2021, ECCV2022) about Compositional Learning for Human-Object Interaction Explorationβ81Updated last year
- β22Updated 2 years ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoningβ65Updated 2 years ago
- [BMVC 2021]: Official PyTorch implementation of : "Few Shot Temporal Action Localization using Query Adaptive Transformers"β20Updated 2 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understandingβ62Updated 2 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"β42Updated 2 years ago
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localizationβ50Updated last year
- Code for CVPR22 paper: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection.β45Updated 6 months ago
- Discovering human interaction with novel objects via zero-shot learning, CVPR, 2020β41Updated 4 years ago
- [ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and β¦β60Updated 3 years ago
- [AAAI 2022] Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics.β17Updated last year
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)β20Updated 3 years ago
- [NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Grapβ¦β73Updated 8 months ago
- β12Updated 3 years ago
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which iβ¦β46Updated last year
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "β¦β19Updated 2 years ago
- Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"β34Updated 2 years ago