Implementation of the paper Video Action Transformer Network
☆138Apr 5, 2021Updated 4 years ago
Alternatives and similar repositories for Video-Action-Transformer-Network-Pytorch-
Users that are interested in Video-Action-Transformer-Network-Pytorch- are comparing it to the libraries listed below
Sorting:
- An implementation of Video Transformer Network (VTN) approach for Action Recognition in TensorFlow.☆55Sep 29, 2020Updated 5 years ago
- Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'☆148Aug 25, 2023Updated 2 years ago
- Code of the STAGE module for video action detection☆48May 25, 2021Updated 4 years ago
- [CVPR 2020] Temporal Pyramid Network for Action Recognition☆392Jan 12, 2021Updated 5 years ago
- ☆16Jan 6, 2025Updated last year
- STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)☆253Oct 19, 2019Updated 6 years ago
- Video Transformer Network☆41Jun 8, 2021Updated 4 years ago
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆60Jan 20, 2021Updated 5 years ago
- "Object-Region Video Transformers”, Herzig et al., CVPR 2022☆50Jul 6, 2022Updated 3 years ago
- A video database bridging human actions and human-object relationships☆157Jun 30, 2020Updated 5 years ago
- Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).☆27Apr 3, 2022Updated 3 years ago
- Spatio-Temporal Action Localization System☆425May 21, 2022Updated 3 years ago
- Action-Localization, Atomic Visual Actions (AVA) Dataset☆25Sep 18, 2019Updated 6 years ago
- Official PyTorch implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21)☆209Apr 19, 2021Updated 4 years ago
- Code for Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization.☆10Sep 28, 2021Updated 4 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,314Updated this week
- Implementation of "Encoraging LSTMs to Anticipate Actions Very Early", ICCV 2017☆19Mar 25, 2018Updated 7 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆87Sep 13, 2021Updated 4 years ago
- Testing code for few-shot action recognition☆11Jan 12, 2021Updated 5 years ago
- Pytorch Implementation of Videos as Space-Time Region Graphs☆27May 30, 2025Updated 9 months ago
- Long-Term Feature Banks for Detailed Video Understanding☆384Aug 30, 2021Updated 4 years ago
- ☆69Apr 26, 2021Updated 4 years ago
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,190Jul 11, 2024Updated last year
- Zero-shot video classification by end-to-end training of 3D convolutional neural networks☆150Jun 14, 2020Updated 5 years ago
- Context-aware RCNN: a Baseline for Action Detection in Videos☆51Oct 13, 2020Updated 5 years ago
- Extension of hLSTMat☆19Apr 15, 2021Updated 4 years ago
- An open-source toolbox for action understanding based on PyTorch☆1,875Apr 8, 2022Updated 3 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- I3D Nonlocal ResNets in Pytorch☆257Mar 26, 2022Updated 3 years ago
- This repository host the code for real-time action detection paper☆320Feb 23, 2021Updated 5 years ago
- ☆16Jul 13, 2016Updated 9 years ago
- Implementation for "Multilevel Language and Vision Integration for Text-to-Clip Retrieval"☆49Jan 21, 2019Updated 7 years ago
- ☆1,041Jun 28, 2020Updated 5 years ago
- Implementation of ViViT: A Video Vision Transformer☆557Jun 21, 2021Updated 4 years ago
- Implementation of QoMEX 2021 "Image Super-Resolution Quality Assessment: Structural Fidelity Versus Statistical Naturalness"☆17Sep 28, 2022Updated 3 years ago
- [CVPR 2022] Official repository of AdaFocusV2.☆91Dec 15, 2024Updated last year
- Code for : [Pattern Recognit. Lett. 2021] "Learn to cycle: Time-consistent feature discovery for action recognition" and [IJCNN 2021] "Mu…☆68Aug 31, 2022Updated 3 years ago
- Implementation of the paper Unsupervised Learning of Video Representations using LSTMs☆10Nov 24, 2017Updated 8 years ago
- Pytorch implementation of "Appearance-Preserving 3D Convolution for Video-based Person Re-identification"☆101Jul 17, 2020Updated 5 years ago