innat / VideoSwinLinks
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
☆33Updated 5 months ago
Alternatives and similar repositories for VideoSwin
Users that are interested in VideoSwin are comparing it to the libraries listed below
Sorting:
- [NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆21Updated last year
- Easiest way of fine-tuning HuggingFace video classification models☆141Updated 2 years ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆100Updated last year
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆298Updated 3 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆89Updated 8 months ago
- Video Swin Transformer - PyTorch☆254Updated 3 years ago
- Video classification exercise using UCF101 data for training an early-fusion and SlowFast architecture model, both using the PyTorch Ligh…☆15Updated 3 years ago
- Exploring the applicability of Grad-CAM for explanation in video based dataset☆32Updated last year
- Action recognition tutorial using UCF-101 dataset.☆26Updated 3 years ago
- An implementation of the X3D video recognition architecture in TensorFlow/Keras☆15Updated 4 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆126Updated 2 years ago
- ☆67Updated 4 years ago
- Keras (TensorFlow v2) reimplementation of Swin Transformer V1 and V2 models☆22Updated 9 months ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆316Updated last year
- menovideo: pytorch library for video action recognition and video understanding☆29Updated 3 years ago
- Code Release for MViTv2 on Image Recognition.☆427Updated 6 months ago
- The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-languag…☆229Updated 2 years ago
- Implementation of brand new video augmentation strategy for video action recognition with 3D CNN☆27Updated 3 years ago
- Implementation of ViViT: A Video Vision Transformer☆537Updated 3 years ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆642Updated 8 months ago
- The official implementation of our paper "Sports Video Analysis on Large-scale Data" (https://arxiv.org/abs/2208.04897)☆71Updated 2 years ago
- This is the pytorch implementation of some representative action recognition approaches including I3D, S3D, TSN and TAM.☆249Updated 3 years ago
- Efficient violence detection in surveillance videos using Human Skeletons and Motion Estimation☆49Updated last year
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆276Updated last year
- I3D features extractor with resnet50 backbone☆73Updated 2 years ago
- A Keras implementation of hybrid efficientnet swin transformer model.☆34Updated last year
- This is an official implementation for "Attention-based Residual Autoencoder for Video Anomaly Detection".☆121Updated 4 months ago
- Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”☆70Updated 4 months ago
- This notebook is designed to plot the attention maps of a vision transformer trained on MNIST digits.☆36Updated last week
- Implementation of Deep Orthogonal Fusion of Local and Global Features in TensorFlow 2☆26Updated last year