innat / VideoSwinLinks
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
☆32Updated 2 months ago
Alternatives and similar repositories for VideoSwin
Users that are interested in VideoSwin are comparing it to the libraries listed below
Sorting:
- [NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆22Updated last year
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆101Updated last year
- Easiest way of fine-tuning HuggingFace video classification models☆147Updated 2 years ago
- Action recognition tutorial using UCF-101 dataset.☆29Updated 4 years ago
- Official implementation of "Delving into CLIP latent space for Video Anomaly Recognition", CVIU 2024☆92Updated 3 months ago
- Video Swin Transformer - PyTorch☆266Updated 3 years ago
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆305Updated 3 years ago
- Video classification exercise using UCF101 data for training an early-fusion and SlowFast architecture model, both using the PyTorch Ligh…☆15Updated 3 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆93Updated last year
- Awesome Video Anomaly Detection☆89Updated 4 months ago
- Library to perform image and video self-supervised learning.☆53Updated last year
- A Keras implementation of hybrid efficientnet swin transformer model.☆34Updated 2 years ago
- Code Release for MViTv2 on Image Recognition.☆451Updated last year
- Awesome Fine-Grained Image Classification☆100Updated 2 months ago
- menovideo: pytorch library for video action recognition and video understanding☆29Updated 4 years ago
- This repository demonstrates how to use TensorFlow based SegFormer model in 🤗 transformers package.☆30Updated 3 years ago
- The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-languag…☆233Updated 3 years ago
- Self-Supervised Learning in PyTorch☆143Updated last year
- This is an official implementation for "Attention-based Residual Autoencoder for Video Anomaly Detection".☆126Updated 11 months ago
- Normalizing Flows for Human Pose Anomaly Detection [ICCV 2023]☆95Updated 2 years ago
- Visualizing the learned space-time attention using Attention Rollout☆40Updated 3 years ago
- PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/implementations - Vision Transformer (ViT), Res…☆42Updated 2 years ago
- Implementation of brand new video augmentation strategy for video action recognition with 3D CNN☆27Updated 4 years ago
- Implementation of Deep Orthogonal Fusion of Local and Global Features in TensorFlow 2☆26Updated 2 years ago
- ☆31Updated 8 months ago
- Keras (TensorFlow v2) reimplementation of Swin Transformer V1 and V2 models☆24Updated last year
- Easy to use class balanced cross entropy and focal loss implementation for Pytorch☆99Updated last year
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆336Updated last year
- A modular PyTorch library for vision transformer models☆164Updated 2 years ago
- jakubmicorek / MULDE-Multiscale-Log-Density-Estimation-via-Denoising-Score-Matching-for-Video-Anomaly-Detection☆51Updated last year