Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”
☆70Jan 9, 2025Updated last year
Alternatives and similar repositories for HIT
Users that are interested in HIT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆21Feb 22, 2024Updated 2 years ago
- ☆10Jan 3, 2023Updated 3 years ago
- This repository is a fork of https://github.com/joslefaure/HIT customized for the AVA dataset☆17Jun 17, 2023Updated 2 years ago
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆38Sep 27, 2023Updated 2 years ago
- This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection☆92Apr 14, 2023Updated 2 years ago
- [ICCV'21] Implementation of "Watch Only Once: An End-to-End Video Action Detection Framework"☆45Jan 17, 2022Updated 4 years ago
- Spatio-Temporal Action Localization System☆425May 21, 2022Updated 3 years ago
- You Only Watch One Frame for Online Spatio-Temporal Action Detection☆36Jun 7, 2023Updated 2 years ago
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆63May 18, 2023Updated 2 years ago
- The second generation of YOWO action detector.☆280May 9, 2024Updated last year
- You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization☆902Oct 28, 2024Updated last year
- Custom ava dataset, Multi-Person Video Dataset Annotation Method of Spatio-Temporally Actions☆134Jun 7, 2022Updated 3 years ago
- [ICCV 2021] MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions☆134Aug 4, 2023Updated 2 years ago
- [CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization☆214Oct 8, 2021Updated 4 years ago
- update code for pytorch1.4☆11Aug 12, 2021Updated 4 years ago
- ☆20Jan 29, 2023Updated 3 years ago
- [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics☆38Sep 10, 2025Updated 6 months ago
- ☆118Mar 11, 2026Updated last week
- Code repository for the paper "On the Benefits of 3D Pose and Tracking for Human Action Recognition", (CVPR 2023)☆286Jan 19, 2024Updated 2 years ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆69Feb 3, 2023Updated 3 years ago
- Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geome…☆14May 8, 2024Updated last year
- Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo☆556Apr 14, 2023Updated 2 years ago
- Code used at paper "Interaction Relational Network for Mutual Action Recognition" TMM 2021.☆16Apr 5, 2021Updated 4 years ago
- STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)☆253Oct 19, 2019Updated 6 years ago
- [CVPR 2022] End-to-End Semi-Supervised Learning for Video Action Detection☆35May 3, 2023Updated 2 years ago
- ☆64Sep 8, 2022Updated 3 years ago
- ☆25Oct 11, 2024Updated last year
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆210Dec 27, 2023Updated 2 years ago
- OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark☆4,951Updated this week
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆603Dec 6, 2023Updated 2 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆10Apr 19, 2024Updated last year
- "Object-Region Video Transformers”, Herzig et al., CVPR 2022☆50Jul 6, 2022Updated 3 years ago
- Code base for zero-shot action localization through spatial-aware object embeddings☆25Nov 3, 2017Updated 8 years ago
- Implementation of <Symbolic Graphics Programming with Large Language Models>☆38Sep 14, 2025Updated 6 months ago
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆47Nov 24, 2023Updated 2 years ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆763Oct 8, 2024Updated last year
- An annotation and instance segmentation-based multi-object tracking and behavior analysis package.☆56Updated this week
- Repo for our Paper: Cross Quality LFW: A database for Analyzing Cross-Resolution Image Face Recognition in Unconstrained Environments☆19Nov 25, 2022Updated 3 years ago
- facenet-pytorch + DeepSORT☆15Aug 21, 2023Updated 2 years ago