jolin830 / SlowFast-Meet-ViT
We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches 26.62%, and if we directly use officially provided chaos_test_1fps.csv as the results of object detection, the mAP reaches 42.28%.
☆13Updated 5 months ago
Alternatives and similar repositories for SlowFast-Meet-ViT:
Users that are interested in SlowFast-Meet-ViT are comparing it to the libraries listed below
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Updated last year
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Updated last year
- You Only Watch One Frame for Online Spatio-Temporal Action Detection☆33Updated last year
- A simple but efficient transformer model for video action recognition☆58Updated 2 years ago
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆59Updated 4 months ago
- Code for "TSGCNeXt: Dynamic-Static Multi-Graph Convolution for Efficient Skeleton-Based Action Recognition with Long-term Learning Potent…☆34Updated last year
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆56Updated last year
- Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”☆66Updated 3 months ago
- [TCSVT 2024] Implementation of the paper "SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recog…☆21Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆61Updated 2 years ago
- Awesome Online Action Detection☆59Updated 3 months ago
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆33Updated last year
- ☆20Updated 2 years ago
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆20Updated 2 months ago
- CV701 Assignment on Pose Estimation☆17Updated 5 months ago
- A Large RGB-D Video Dataset for Long-Distance Continuous Gesture Recognition☆25Updated 2 years ago
- A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"☆13Updated 3 years ago
- Codebase for "Every Shot Counts: Using Exemplars for Repetition Counting in Videos"☆25Updated 4 months ago
- ☆28Updated 2 years ago
- [CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos☆11Updated 10 months ago
- BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection☆50Updated last year
- ☆9Updated 2 years ago
- ☆17Updated 6 months ago
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆20Updated last year
- [ACM MM 2023] Lightweight Super-Resolution Head for Human Pose Estimation☆38Updated last year
- [ACMMM 2023] Skeleton-MixFormer: Multivariate Topology Representation for Skeleton-based Action Recognition☆21Updated last year
- [TPAMI2024 / ICME2023] Codes for my paper "Body-Part Joint Detection and Association via Extended Object Representation"☆38Updated last year
- Video Feature Enhancement with PyTorch☆28Updated 4 months ago
- [NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"☆50Updated last year
- PoseRAC: Pose Saliency Transformer for Repetitive Action Counting☆15Updated 2 years ago