Vision-CAIR / RelTransformer
β29Updated last year
Alternatives and similar repositories for RelTransformer:
Users that are interested in RelTransformer are comparing it to the libraries listed below
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detectionβ24Updated 3 years ago
- π΄ββοΈ ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection (MM 2020)β32Updated last year
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)β36Updated 3 years ago
- β22Updated 2 years ago
- Official implementation of BGNN(CVPR 2021)β20Updated 3 years ago
- β48Updated 3 years ago
- CVPR2021: Detecting Human-Object Interaction via Fabricated Compositional Learningβ15Updated 3 years ago
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)β20Updated 3 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"β18Updated 2 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"β42Updated 2 years ago
- Series of work (ECCV2020, CVPR2021, CVPR2021, ECCV2022) about Compositional Learning for Human-Object Interaction Explorationβ80Updated 2 years ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022β35Updated last year
- Discovering human interaction with novel objects via zero-shot learning, CVPR, 2020β42Updated 4 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.β32Updated 2 years ago
- β22Updated 3 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challengeβ39Updated 2 years ago
- β35Updated last year
- praneeth11009 / LIGHTEN-Learning-Interactions-with-Graphs-and-Hierarchical-TEmporal-Networks-for-HOIβ16Updated 4 years ago
- [AAAI 2022] Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics.β17Updated last year
- Learning Representational Invariances for Data-Efficient Action Recognitionβ33Updated 3 years ago
- [ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and β¦β62Updated 3 years ago
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"β36Updated 2 years ago
- Placeholder for code of BSP.β11Updated 3 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understandingβ62Updated 3 years ago
- Code for CVPR22 paper: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection.β46Updated last month
- [AAAI2023] Repo for the paper ''End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation''.β23Updated 2 years ago
- [NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Grapβ¦β73Updated 11 months ago
- β16Updated 4 months ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"β15Updated last year
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)β33Updated 3 years ago