jolin830 / SlowFast-Meet-ViTLinks
We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches 26.62%, and if we directly use officially provided chaos_test_1fps.csv as the results of object detection, the mAP reaches 42.28%.
☆13Updated 6 months ago
Alternatives and similar repositories for SlowFast-Meet-ViT
Users that are interested in SlowFast-Meet-ViT are comparing it to the libraries listed below
Sorting:
- You Only Watch One Frame for Online Spatio-Temporal Action Detection☆33Updated 2 years ago
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Updated last year
- Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”☆70Updated 4 months ago
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆32Updated last year
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Updated last year
- A simple but efficient transformer model for video action recognition☆58Updated 2 years ago
- A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"☆13Updated 3 years ago
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆59Updated 2 years ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆65Updated 2 years ago
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆20Updated last year
- Code for "TSGCNeXt: Dynamic-Static Multi-Graph Convolution for Efficient Skeleton-Based Action Recognition with Long-term Learning Potent…☆34Updated last year
- download AVA dataset☆22Updated last year
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆45Updated last year
- Awesome Online Action Detection☆62Updated 4 months ago
- ☆20Updated 2 years ago
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆61Updated 5 months ago
- BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection☆51Updated last year
- Video Feature Enhancement with PyTorch☆29Updated 6 months ago
- Code for "LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model", CVPR 2024 Highlight☆44Updated 11 months ago
- [TCSVT 2024] Implementation of the paper "SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recog…☆21Updated last year
- [CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos☆11Updated 11 months ago
- A Large RGB-D Video Dataset for Long-Distance Continuous Gesture Recognition☆26Updated 2 years ago
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆48Updated 11 months ago
- [ACMMM 2024] Implementation of the paper “Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition“.☆42Updated 2 months ago
- CV701 Assignment on Pose Estimation☆17Updated 6 months ago
- [ACMMM 2023] Skeleton-MixFormer: Multivariate Topology Representation for Skeleton-based Action Recognition☆22Updated last year
- TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)☆50Updated last year
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆22Updated 3 months ago
- This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection☆81Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Semi-Supervised Temporal Action Detection with Proposal-Free Masking "☆21Updated last year