[ICCV'21] Implementation of "Watch Only Once: An End-to-End Video Action Detection Framework"
☆45Jan 17, 2022Updated 4 years ago
Alternatives and similar repositories for WOO
Users that are interested in WOO are comparing it to the libraries listed below
Sorting:
- Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”☆70Jan 9, 2025Updated last year
- This repository is a fork of https://github.com/joslefaure/HIT customized for the AVA dataset☆17Jun 17, 2023Updated 2 years ago
- [CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization☆214Oct 8, 2021Updated 4 years ago
- [ICCV 2021] MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions☆133Aug 4, 2023Updated 2 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18May 10, 2023Updated 2 years ago
- You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization☆899Oct 28, 2024Updated last year
- A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"☆14Sep 26, 2021Updated 4 years ago
- You Only Watch One Frame for Online Spatio-Temporal Action Detection☆36Jun 7, 2023Updated 2 years ago
- [AAAI 2022] DCAN: Improving Temporal Action Detection via Dual Context Aggregation☆17Nov 13, 2022Updated 3 years ago
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆21Feb 22, 2024Updated 2 years ago
- STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)☆252Oct 19, 2019Updated 6 years ago
- Code for one-stage adaptive set-based HOI detector AS-Net.☆52May 8, 2021Updated 4 years ago
- Spatio-Temporal Action Localization System☆424May 21, 2022Updated 3 years ago
- [ECCV 2020] Actions as Moving Points☆270Dec 19, 2020Updated 5 years ago
- ☆53Apr 17, 2022Updated 3 years ago
- super image for action recognition☆56Mar 8, 2022Updated 3 years ago
- [CVPR 2022 Oral] AdaMixer: A Fast-Converging Query-Based Object Detector☆237Aug 17, 2022Updated 3 years ago
- The second generation of YOWO action detector.☆276May 9, 2024Updated last year
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Jul 22, 2023Updated 2 years ago
- ☆35Oct 21, 2023Updated 2 years ago
- [CVPR 2022] End-to-End Semi-Supervised Learning for Video Action Detection☆35May 3, 2023Updated 2 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆33Apr 18, 2022Updated 3 years ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆81Nov 16, 2025Updated 3 months ago
- Code for the CVPR 2020 oral paper: Weakly Supervised Visual Semantic Parsing☆33Dec 8, 2022Updated 3 years ago
- Official implementation for paper "Relational Surrogate Loss Learning", ICLR 2022☆37Nov 25, 2022Updated 3 years ago
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆38Sep 27, 2023Updated 2 years ago
- This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection☆89Apr 14, 2023Updated 2 years ago
- Implementation for the CVPR2019 paper "Graphical Contrastive Losses for Scene Graph Parsing"☆12Nov 11, 2019Updated 6 years ago
- The code is for the CVPR 2019 paper 'Dance with Flow: Two-in-One Stream for Action Detection '☆32Nov 21, 2022Updated 3 years ago
- Official repository for 'Risk of Bias in Chest Radiography Deep Learning Foundation Models'☆12Sep 27, 2023Updated 2 years ago
- ☆14Aug 1, 2025Updated 7 months ago
- A simplified pytorch version of densecap☆42Dec 11, 2024Updated last year
- ☆41Sep 21, 2023Updated 2 years ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago
- Official Code Release for Container : Context Aggregation Network☆46Oct 17, 2021Updated 4 years ago
- ☆51Jul 7, 2025Updated 7 months ago
- ☆40Feb 14, 2023Updated 3 years ago
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆94Jan 16, 2024Updated 2 years ago
- DepthHuman: A tool for depth image synthesis for human pose estimation☆12Jul 28, 2021Updated 4 years ago