MCG-NJU / STMixer
[CVPR 2023] STMixer: A One-Stage Sparse Action Detector
☆49Updated last year
Related projects: ⓘ
- This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection☆67Updated last year
- BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection☆49Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆48Updated last year
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆17Updated 6 months ago
- Awesome Online Action Detection☆43Updated last month
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆101Updated last year
- [CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection☆81Updated last year
- ☆47Updated 2 years ago
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆32Updated 2 months ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆20Updated 10 months ago
- Utilities for the human-object interaction detection dataset HICO-DET☆50Updated 9 months ago
- [ICCV'23] Official PyTorch implementation for paper "Exploring Predicate Visual Context in Detecting Human-Object Interactions"☆62Updated 2 months ago
- A simple but efficient transformer model for video action recognition☆52Updated last year
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆52Updated 6 months ago
- TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)☆39Updated 5 months ago
- OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]☆79Updated 7 months ago
- Code for our ICCV 2021 Paper "OadTR: Online Action Detection with Transformers".☆88Updated last year
- BEAR: a new BEnchmark on video Action Recognition☆40Updated 4 months ago
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆43Updated 11 months ago
- [ICCV 2021] MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions☆106Updated last year
- ☆34Updated 2 years ago
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆160Updated 8 months ago
- [ICCV'21] Implementation of "Watch Only Once: An End-to-End Video Action Detection Framework"☆44Updated 2 years ago
- Official implementation of "ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction Detection in Videos" (ACM ICMRW 2021)☆50Updated 2 years ago
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆38Updated 9 months ago
- ☆71Updated last year
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆24Updated 11 months ago
- A Unified Toolbox for Object Perception & Application☆148Updated 10 months ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆83Updated last week
- (TPAMI2024) Official implementation of Paper ''A Versatile Framework for Multi-scene Person Re-identification''☆31Updated 5 months ago