tanqiu98 / 2G-GCNView external linksLinks
Code for the ECCV'22 paper "Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos".
☆27Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for 2G-GCN
Users that are interested in 2G-GCN are comparing it to the libraries listed below
Sorting:
- Space-Time Interaction Graph Parsing Networks for Human-Object Interaction Recognition,ACM MM'21☆14May 12, 2022Updated 3 years ago
- Code for CVPR'21 paper "Learning Asynchronous and Sparse Human-Object Interaction in Videos".☆24Aug 6, 2021Updated 4 years ago
- Code for CVPR22 paper: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection.☆49Jun 5, 2025Updated 8 months ago
- praneeth11009 / LIGHTEN-Learning-Interactions-with-Graphs-and-Hierarchical-TEmporal-Networks-for-HOI☆16Sep 19, 2020Updated 5 years ago
- ☆26Oct 8, 2021Updated 4 years ago
- Open Set Video HOI detection from Action-centric Chain-of-Look Prompting, ICCV2023☆12Oct 3, 2023Updated 2 years ago
- ☆27Jun 11, 2022Updated 3 years ago
- CVPR2022 Distillation Using Oracle Queries for Transformer-based Human-Object Interaction Detection☆24Sep 17, 2022Updated 3 years ago
- ☆49Mar 8, 2022Updated 3 years ago
- ☆33Feb 11, 2023Updated 3 years ago
- [IROS 2023] Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition☆21Jul 12, 2025Updated 7 months ago
- Repo for CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"☆142Jul 20, 2022Updated 3 years ago
- Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).☆26Apr 3, 2022Updated 3 years ago
- [CVPR'22] Official PyTorch implementation for paper "Efficient Two-Stage Detection of Human–Object Interactions with a Novel Unary–Pairwi…☆165Jun 22, 2023Updated 2 years ago
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆33Nov 29, 2024Updated last year
- A tool built on top of OpenFace to detect eye contact with babies.☆13Nov 27, 2018Updated 7 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆39Dec 5, 2022Updated 3 years ago
- Code for our CVPR 2022 Paper "GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection"☆88Mar 31, 2024Updated last year
- Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)☆11Oct 24, 2021Updated 4 years ago
- ☆10Jan 9, 2025Updated last year
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 5, 2026Updated last week
- Code for "DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection" (AAAI 2021)☆38Mar 14, 2021Updated 4 years ago
- Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'☆148Aug 25, 2023Updated 2 years ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated last year
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- ☆13May 21, 2024Updated last year
- Large Language-and-Vision Assistant for BioMedicine, built towards multimodal GPT-4 level capabilities.☆10Nov 29, 2023Updated 2 years ago
- Applying Deep Reinforcement Learning for dialogue generation. aka chatbot☆13Apr 30, 2017Updated 8 years ago
- ☆12Apr 26, 2022Updated 3 years ago
- [CVPR 2023] ViPLO - Official Pytorch Implementation☆42Jun 22, 2023Updated 2 years ago
- ☆10Nov 9, 2022Updated 3 years ago
- Cascade Pose Regression by global tunning☆12May 22, 2015Updated 10 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- Dur360BEV: (ICRA 2025) A Real-world 360-degree Single Camera Dataset and Benchmark for Bird-Eye View Mapping in Autonomous Driving☆23Feb 2, 2026Updated last week
- Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".☆11Nov 13, 2024Updated last year
- UVA-Human-Skeleton-Preprocessing☆10May 4, 2023Updated 2 years ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Jan 20, 2024Updated 2 years ago
- Tools for the Parse-27k Dataset - evaluation routines and some simple scripts to get started...☆10Jul 16, 2016Updated 9 years ago