jolin830 / SlowFast-Meet-ViTLinks
We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches 26.62%, and if we directly use officially provided chaos_test_1fps.csv as the results of object detection, the mAP reaches 42.28%.
☆12Updated last year
Alternatives and similar repositories for SlowFast-Meet-ViT
Users that are interested in SlowFast-Meet-ViT are comparing it to the libraries listed below
Sorting:
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆38Updated 2 years ago
- You Only Watch One Frame for Online Spatio-Temporal Action Detection☆36Updated 2 years ago
- Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”☆69Updated last year
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆63Updated 2 years ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆23Updated 2 years ago
- ☆112Updated 3 years ago
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Updated 2 years ago
- ☆109Updated 3 months ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆69Updated 2 years ago
- [ACMMM 2024] Implementation of the paper “Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition“.☆45Updated 9 months ago
- download AVA dataset☆22Updated 2 years ago
- 动作识别(Action Recognition)常见模型的Pytorch实现☆34Updated 5 years ago
- The second generation of YOWO action detector.☆273Updated last year
- Tiny Kinetics-400 for test☆95Updated last year
- Awesome Online Action Detection☆71Updated 11 months ago
- YOLO-POSE was used for key point detection, Bytetrack for tracking, and STGCN for fall and other behavior recognition☆50Updated 2 years ago
- A code support using OpenCV, Yolo, SimCC, SMPLX, Open3D, FBX-SDK, Blender and Maya.☆42Updated last year
- Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"☆149Updated 9 months ago
- This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection☆90Updated 2 years ago
- ☆64Updated 3 years ago
- A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"☆14Updated 4 years ago
- Code for "TSGCNeXt: Dynamic-Static Multi-Graph Convolution for Efficient Skeleton-Based Action Recognition with Long-term Learning Potent…☆38Updated 4 months ago
- Some scripts on generating homemade AVA format datasets☆24Updated 3 years ago
- [ICLR 2023] Official implementation of the paper "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "☆187Updated 2 years ago
- Code of paper "A Video Dataset for Falling Object Detection around Buildings" https://arxiv.org/abs/2408.05750☆16Updated 6 months ago
- Custom ava dataset, Multi-Person Video Dataset Annotation Method of Spatio-Temporally Actions☆132Updated 3 years ago
- [OpenPAR] An open-source framework for Pedestrian Attribute Recognition, based on PyTorch☆170Updated last month
- Efficient dual attention SlowFast networks for video action recognition☆24Updated 3 years ago
- ☆20Updated 2 years ago
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆20Updated last year