MCG-NJU / STMixer
[CVPR 2023] STMixer: A One-Stage Sparse Action Detector
☆58Updated last year
Alternatives and similar repositories for STMixer:
Users that are interested in STMixer are comparing it to the libraries listed below
- This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection☆79Updated 2 years ago
- BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection☆50Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆64Updated 2 years ago
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆20Updated last year
- [ICCV 2021] MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions☆123Updated last year
- A Unified Toolbox for Object Perception & Application☆161Updated last year
- Awesome Online Action Detection☆59Updated 3 months ago
- Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”☆67Updated 3 months ago
- A simple but efficient transformer model for video action recognition☆58Updated 2 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆126Updated last year
- [ICCV'21] Implementation of "Watch Only Once: An End-to-End Video Action Detection Framework"☆44Updated 3 years ago
- [CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection☆84Updated 2 years ago
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆33Updated last year
- [TIP 2022] End-to-end Temporal Action Detection with Transformer☆152Updated 2 years ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Updated last year
- download AVA dataset☆21Updated last year
- ☆113Updated last year
- Code for our ICCV 2021 Paper "OadTR: Online Action Detection with Transformers".☆90Updated last year
- Official PyTorch implementation of UniHCP☆157Updated last year
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆183Updated last year
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆44Updated last year
- OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]☆102Updated 6 months ago
- BEAR: a new BEnchmark on video Action Recognition☆43Updated last year
- ☆58Updated last year
- [CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers☆179Updated last year
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆88Updated 7 months ago
- Utilities for the human-object interaction detection dataset HICO-DET☆58Updated last year
- Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"☆107Updated last year
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆45Updated last year
- Code for "Contextual Instance Decoupling for Robust Multi-Person Pose Estimation", CVPR 2022 Oral☆50Updated 2 years ago