jolin830 / SlowFast-Meet-ViTLinks

We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches 26.62%, and if we directly use officially provided chaos_test_1fps.csv as the results of object detection, the mAP reaches 42.28%.

☆13

Alternatives and similar repositories for SlowFast-Meet-ViT

Users that are interested in SlowFast-Meet-ViT are comparing it to the libraries listed below

Sorting:

yjh0410 / YOWOF
You Only Watch One Frame for Online Spatio-Temporal Action Detection
☆34Updated 2 years ago
MCG-NJU / EVAD
[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement
☆36Updated last year
mondalanindya / MSQNet
Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]
☆24Updated last year
MCG-NJU / STMixer
[CVPR 2023] STMixer: A One-Stage Sparse Action Detector
☆60Updated 2 years ago
leftthomas / SlowFast
A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"
☆13Updated 3 years ago
MartinXM / TPS
A simple but efficient transformer model for video action recognition
☆59Updated 2 years ago
joslefaure / HIT
Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”
☆69Updated 6 months ago
JunweiLiang / aicity_action
Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)
☆28Updated last year
zyayoung / ByteTrackInference
YOLOX Inference code for MOTRv2
☆18Updated 2 years ago
MCG-NJU / VideoMAE-Action-Detection
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
☆66Updated 2 years ago
webber2933 / iCLIP
[ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection
☆20Updated last year
yjh0410 / PyTorch_YOWO
☆107Updated 2 years ago
sallymmx / m2clip
[AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition
☆63Updated 6 months ago
MCG-NJU / PointTAD
[NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points
☆45Updated last year
guanxiongsun / vfe.pytorch
Video Feature Enhancement with PyTorch
☆31Updated 7 months ago
wangxiang1230 / Awesome-Online-Action-Detection
Awesome Online Action Detection
☆63Updated 5 months ago
vvhj / TSGCNeXt
Code for "TSGCNeXt: Dynamic-Static Multi-Graph Convolution for Efficient Skeleton-Based Action Recognition with Long-term Learning Potent…
☆37Updated 2 years ago
Event-AHU / VTF_PAR
[CVPR-2023 Workshop@NFVLR] Official PyTorch implementation of Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestr…
☆28Updated 3 months ago
hotfinda / VideoMambaPro
Improving Mamaba performance on Video Understanding task
☆39Updated 8 months ago
zhshj0110 / SiT-MLP
[TCSVT 2024] Implementation of the paper "SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recog…
☆20Updated last year
amazon-science / tubelet-transformer
This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection
☆81Updated 2 years ago
yjh0410 / AVA_Dataset
download AVA dataset
☆22Updated last year
thearkaprava / MS-Temba
Official Repository of 'Multi-Scale Temporal Mamba for Efficient Temporal Action Detection'
☆22Updated this week
George-Zhuang / NetTrack
Official code for NetTrack [CVPR 2024]
☆93Updated last year
alibaba-mmai-research / TAdaConv
[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, vi…
☆239Updated last year
KHU-VLL / CAST
[NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"
☆52Updated last year
MCG-NJU / BasicTAD
BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection
☆51Updated 2 years ago
shengyuhao / DIVOTrack
A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes (IJCV 2024)
☆92Updated 11 months ago
realgump / MvMHAT
MvMHAT: Self-supervised Multi-view Multi-Human Association and Tracking (ACM MM 2021, Oral Paper)
☆48Updated 3 years ago
Nathan-Li123 / SMOTer
[ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking
☆49Updated 7 months ago