Vision-CAIR / RelTransformerLinks
β29Updated last year
Alternatives and similar repositories for RelTransformer
Users that are interested in RelTransformer are comparing it to the libraries listed below
Sorting:
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detectionβ24Updated 3 years ago
- π΄ββοΈ ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection (MM 2020)β33Updated last year
- The 1st place solution of 2022 Ego4d Natural Language Queries.β32Updated 2 years ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistencyβ17Updated 2 years ago
- praneeth11009 / LIGHTEN-Learning-Interactions-with-Graphs-and-Hierarchical-TEmporal-Networks-for-HOIβ16Updated 4 years ago
- β22Updated 3 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)β36Updated 3 years ago
- β22Updated 3 years ago
- β35Updated last year
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"β42Updated 2 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challengeβ39Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "β¦β17Updated 2 years ago
- β48Updated 3 years ago
- CVPR2021: Detecting Human-Object Interaction via Fabricated Compositional Learningβ15Updated 3 years ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022β35Updated last year
- [CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.β31Updated last year
- [NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Grapβ¦β73Updated last year
- Official implementation of BGNN(CVPR 2021)β20Updated 3 years ago
- Discovering human interaction with novel objects via zero-shot learning, CVPR, 2020β42Updated 4 years ago
- [AAAI 2022] Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics.β17Updated last year
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which iβ¦β47Updated last year
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)β34Updated 3 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"β19Updated 2 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervisionβ¦β37Updated 4 years ago
- Code for CVPR22 paper: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection.β47Updated 2 weeks ago
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"β37Updated 2 years ago
- Learning Representational Invariances for Data-Efficient Action Recognitionβ33Updated 3 years ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"β15Updated last year
- This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in thβ¦β66Updated 2 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understandingβ62Updated 3 years ago