jolin830 / SlowFast-Meet-ViTLinks
We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches 26.62%, and if we directly use officially provided chaos_test_1fps.csv as the results of object detection, the mAP reaches 42.28%.
☆13Updated 7 months ago
Alternatives and similar repositories for SlowFast-Meet-ViT
Users that are interested in SlowFast-Meet-ViT are comparing it to the libraries listed below
Sorting:
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Updated last year
- You Only Watch One Frame for Online Spatio-Temporal Action Detection☆34Updated 2 years ago
- A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"☆13Updated 3 years ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Updated last year
- Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”☆69Updated 5 months ago
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆59Updated 2 years ago
- A simple but efficient transformer model for video action recognition☆59Updated 2 years ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆65Updated 2 years ago
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆34Updated last year
- Code for "TSGCNeXt: Dynamic-Static Multi-Graph Convolution for Efficient Skeleton-Based Action Recognition with Long-term Learning Potent…☆35Updated last year
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆20Updated last year
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆45Updated last year
- ☆20Updated 2 years ago
- ☆9Updated 2 years ago
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆63Updated 6 months ago
- CV701 Assignment on Pose Estimation☆18Updated 7 months ago
- Awesome Online Action Detection☆62Updated 5 months ago
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆24Updated 4 months ago
- Frame Flexible Network (CVPR2023)☆56Updated 2 years ago
- This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection☆81Updated 2 years ago
- [ACMMM 2023] Skeleton-MixFormer: Multivariate Topology Representation for Skeleton-based Action Recognition☆22Updated last year
- BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection☆51Updated 2 years ago
- Improving Mamaba performance on Video Understanding task☆40Updated 8 months ago
- [TCSVT 2024] Implementation of the paper "SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recog…☆20Updated last year
- Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.☆21Updated last week
- download AVA dataset☆22Updated last year
- This repository is a fork of https://github.com/joslefaure/HIT customized for the AVA dataset☆16Updated 2 years ago
- Official Repository of 'Multi-Scale Temporal Mamba for Efficient Temporal Action Detection'☆20Updated last month
- update code for pytorch1.4☆11Updated 3 years ago
- A Large RGB-D Video Dataset for Long-Distance Continuous Gesture Recognition☆26Updated 2 years ago