innat / VideoSwinLinks
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
☆33Updated 8 months ago
Alternatives and similar repositories for VideoSwin
Users that are interested in VideoSwin are comparing it to the libraries listed below
Sorting:
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆101Updated last year
- Easiest way of fine-tuning HuggingFace video classification models☆142Updated 2 years ago
- Vision Transformers for image classification, image segmentation, and object detection.☆57Updated 10 months ago
- [NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆22Updated last year
- Action recognition tutorial using UCF-101 dataset.☆28Updated 3 years ago
- Video classification exercise using UCF101 data for training an early-fusion and SlowFast architecture model, both using the PyTorch Ligh…☆15Updated 3 years ago
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆303Updated 3 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆92Updated 11 months ago
- Code Release for MViTv2 on Image Recognition.☆438Updated 9 months ago
- A Detection Toolbox for Tensorflow2☆56Updated 2 years ago
- Awesome Fine-Grained Image Classification☆86Updated last year
- The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-languag…☆230Updated 3 years ago
- ☆76Updated 2 months ago
- Video Swin Transformer - PyTorch☆263Updated 3 years ago
- my codes for learning attention mechanism☆50Updated 5 years ago
- non-official NoisyNN Implemnentation☆50Updated last year
- [BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"☆161Updated last year
- Official code repository for ICML 2025 paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Doma…☆43Updated last week
- Visualizing the learned space-time attention using Attention Rollout☆36Updated 3 years ago
- Basic implementation of ResNet 50, 101, 152 in PyTorch☆111Updated 3 years ago
- Fine-tune Facebook's DETR (DEtection TRansformer) on Colaboratory.☆151Updated 2 years ago
- Self-Supervised Learning in PyTorch☆138Updated last year
- ModelSoups for Tensorflow2 and Torch☆49Updated 3 years ago
- This folder of code contains code and notebooks to supplement the "Vision Transformers Explained" series published on Towards Data Scienc…☆88Updated last year
- [ICCV25] Official Implementation of LeGrad☆78Updated 10 months ago
- xLSTM as Generic Vision Backbone☆485Updated 10 months ago
- A Keras implementation of hybrid efficientnet swin transformer model.☆34Updated last year
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…☆79Updated 11 months ago
- Implementation of ViViT: A Video Vision Transformer☆547Updated 4 years ago
- Exploring the applicability of Grad-CAM for explanation in video based dataset☆32Updated 2 years ago