MCG-NJU / STMixerView external linksLinks
[CVPR 2023] STMixer: A One-Stage Sparse Action Detector
☆63May 18, 2023Updated 2 years ago
Alternatives and similar repositories for STMixer
Users that are interested in STMixer are comparing it to the libraries listed below
Sorting:
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆38Sep 27, 2023Updated 2 years ago
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- [ICCV 2021] MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions☆133Aug 4, 2023Updated 2 years ago
- [ICCV 2023] Deep Equilibrium Object Detection☆27Jun 18, 2025Updated 7 months ago
- [TIP] APP-Net: Auxiliary-point-based Push and Pull Operations for Efficient Point Cloud Recognition☆13May 15, 2023Updated 2 years ago
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆33Aug 30, 2023Updated 2 years ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆69Feb 3, 2023Updated 3 years ago
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆21Feb 22, 2024Updated last year
- [ECCV 2020] Actions as Moving Points☆270Dec 19, 2020Updated 5 years ago
- [NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking☆201Apr 20, 2024Updated last year
- ☆15Jan 25, 2025Updated last year
- LSTC: Boosting Atomic Action Detection with Long-Short-Term Context☆10Sep 1, 2022Updated 3 years ago
- Hopenet: deep head pose estimator on ncnn☆10Jun 18, 2020Updated 5 years ago
- [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Jul 15, 2022Updated 3 years ago
- [ICCV 2023] SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes☆195Jul 24, 2023Updated 2 years ago
- [CVPR 2022] Task-specific Inconsistency Alignment for Domain Adaptive Object Detection☆40Jul 20, 2022Updated 3 years ago
- BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection☆52Jun 10, 2023Updated 2 years ago
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆38Aug 20, 2024Updated last year
- Python3 API Darknet☆14Dec 7, 2023Updated 2 years ago
- Base on retinaface and centerface modefied. frame work depend on pytorch.☆31Jul 23, 2020Updated 5 years ago
- ☆20Jan 29, 2023Updated 3 years ago
- The second generation of YOWO action detector.☆276May 9, 2024Updated last year
- This repository is a fork of https://github.com/joslefaure/HIT customized for the AVA dataset☆17Jun 17, 2023Updated 2 years ago
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆46Nov 24, 2023Updated 2 years ago
- [CVPR 2024] Sparse Global Matching for Video Frame Interpolation with Large Motion☆78Jul 4, 2024Updated last year
- SCRFD face detection based on MNN inference framework☆17Sep 22, 2021Updated 4 years ago
- STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)☆252Oct 19, 2019Updated 6 years ago
- "Object-Region Video Transformers”, Herzig et al., CVPR 2022☆50Jul 6, 2022Updated 3 years ago
- This is a PyTorch implementation of "VirFace: Enhancing Face Recognition via Unlabeled Shallow Data" (CVPR 2021).☆22Sep 30, 2022Updated 3 years ago
- [CVPR 2023] LinK: Linear Kernel for LiDAR-based 3D Perception☆98Jul 27, 2024Updated last year
- A Fine-grained Benchmark for Video Captioning and Retrieval☆26Jul 16, 2025Updated 6 months ago
- [CVIU] Fully Convolutional Online Tracking☆92Nov 29, 2020Updated 5 years ago
- A sample code for Lightweight Face Recognition competition ICCV2019☆24Nov 14, 2019Updated 6 years ago
- Implementation of "Spatio-Temporal Deformable Attention Network for Video Deblurring". (Zhang et al., ECCV 2022)☆60Jan 2, 2023Updated 3 years ago
- [ICCV 2021] Self Supervision to Distillation for Long-Tailed Visual Recognition☆21Feb 9, 2022Updated 4 years ago
- ☆24Mar 9, 2021Updated 4 years ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,675Dec 8, 2023Updated 2 years ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆30Feb 4, 2024Updated 2 years ago
- DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)☆32Apr 8, 2023Updated 2 years ago