innat / VideoSwin
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
☆29Updated last month
Alternatives and similar repositories for VideoSwin:
Users that are interested in VideoSwin are comparing it to the libraries listed below
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆93Updated 8 months ago
- Easiest way of fine-tuning HuggingFace video classification models☆136Updated last year
- [NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆16Updated last year
- Self-Supervised Learning in PyTorch☆133Updated 10 months ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆89Updated 4 months ago
- A Keras implementation of hybrid efficientnet swin transformer model.☆32Updated last year
- [BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"☆144Updated last year
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆288Updated 2 years ago
- Code Release for MViTv2 on Image Recognition.☆416Updated last month
- Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.☆100Updated 2 years ago
- A summarization of Transformer-based architectures for CV tasks, including image classification, object detection, segmentation, and Few-…☆107Updated 2 years ago
- ☆62Updated 3 months ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆299Updated 9 months ago
- Video classification exercise using UCF101 data for training an early-fusion and SlowFast architecture model, both using the PyTorch Ligh…☆14Updated 3 years ago
- Exploring the applicability of Grad-CAM for explanation in video based dataset☆28Updated last year
- This repository demonstrates how to use TensorFlow based SegFormer model in 🤗 transformers package.☆30Updated 2 years ago
- PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/implementations - Vision Transformer (ViT), Res…☆36Updated last year
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆148Updated 2 years ago
- ☆14Updated 3 years ago
- "Tail-Aware Sperm Analysis for Transparent Tracking of Spermatozoa" Official Implementation☆10Updated 6 months ago
- ☆66Updated 3 years ago
- An implementation of the X3D video recognition architecture in TensorFlow/Keras☆15Updated 3 years ago
- ☆30Updated last year
- Video Swin Transformer - PyTorch☆237Updated 3 years ago
- Keras (TensorFlow v2) reimplementation of Swin Transformer V1 and V2 models☆21Updated 5 months ago
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆28Updated 3 months ago
- Fine-tuning OpenAI CLIP Model for Image Search on medical images☆75Updated 2 years ago
- [ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applicatio…☆264Updated last year
- Repository accompanying the "Sign Pose-based Transformer for Word-level Sign Language Recognition" paper☆82Updated last year
- 2D discrete Wavelet Transform for Image Classification and Segmentation☆79Updated 2 weeks ago