innat / VideoSwinLinks
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
☆32Updated 3 months ago
Alternatives and similar repositories for VideoSwin
Users that are interested in VideoSwin are comparing it to the libraries listed below
Sorting:
- Easiest way of fine-tuning HuggingFace video classification models☆147Updated 2 years ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆101Updated last year
- [NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆22Updated 2 years ago
- "Tail-Aware Sperm Analysis for Transparent Tracking of Spermatozoa" Official Implementation☆10Updated 2 weeks ago
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆305Updated 3 years ago
- Video Swin Transformer - PyTorch☆265Updated 4 years ago
- Action recognition tutorial using UCF-101 dataset.☆29Updated 4 years ago
- ☆48Updated 7 months ago
- This repository demonstrates how to use TensorFlow based SegFormer model in 🤗 transformers package.☆30Updated 3 years ago
- Code Release for MViTv2 on Image Recognition.☆450Updated last year
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆94Updated last year
- Awesome Fine-Grained Image Classification☆101Updated 4 months ago
- Implementation of brand new video augmentation strategy for video action recognition with 3D CNN☆27Updated 4 years ago
- Library to perform image and video self-supervised learning.☆54Updated 2 weeks ago
- Implementation of Deep Orthogonal Fusion of Local and Global Features in TensorFlow 2☆26Updated 2 years ago
- Implementation of ViViT: A Video Vision Transformer☆556Updated 4 years ago
- 2nd Place Google - Isolated Sign Language Recognition☆47Updated 2 years ago
- Official implementation of "Delving into CLIP latent space for Video Anomaly Recognition", CVIU 2024☆93Updated 4 months ago
- LRCN approach for video regression that uses CNNs for visual input and LSTMs to process sequences of frame embeddings☆21Updated 5 years ago
- Vision Transformer Cookbook with Tensorflow☆343Updated 3 years ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆339Updated last year
- Self-Supervised Learning in PyTorch☆143Updated last year
- ☆14Updated 4 years ago
- Implementation of Swin Transformers in TensorFlow along with converted pre-trained models, code for off-the-shelf classification and fine…☆59Updated 3 years ago
- Awesome Video Anomaly Detection☆104Updated 5 months ago
- This is the official repo of paper accepted in AAAI 2023 Oral.☆91Updated 2 years ago
- A PyTorch implementation of "MetaFormer: A Unified Meta Framework for Fine-Grained Recognition". A reference PyTorch implementation of “C…☆243Updated 3 years ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆748Updated last year
- Vision Transformers for image classification, image segmentation, and object detection.☆63Updated 3 months ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆134Updated 2 years ago