innat / VideoSwin
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
☆26Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for VideoSwin
- Easiest way of fine-tuning HuggingFace video classification models☆133Updated last year
- Implementation of Deep Orthogonal Fusion of Local and Global Features in TensorFlow 2☆25Updated last year
- A Keras implementation of hybrid efficientnet swin transformer model.☆33Updated last year
- Implementation of Swin Transformers in TensorFlow along with converted pre-trained models, code for off-the-shelf classification and fine…☆56Updated 2 years ago
- This repository demonstrates how to use TensorFlow based SegFormer model in 🤗 transformers package.☆30Updated 2 years ago
- Keras (TensorFlow v2) reimplementation of Swin Transformer V1 and V2 models☆20Updated 2 months ago
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆283Updated 2 years ago
- ☆24Updated 2 years ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆88Updated 6 months ago
- vision transformers with pytorch and pytorch lightning☆0Updated 3 weeks ago
- PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/implementations - Vision Transformer (ViT), Res…☆34Updated last year
- Implementation of Multi-Attention, consist of CBAM and DeepMoji☆8Updated 3 years ago
- A library that includes Keras3 layers, blocks and models with pretrained weights, providing support for transfer learning, feature extrac…☆40Updated 3 weeks ago
- Tensorflow 2.0 Implementation of GCViT: Global Context Vision Transformer☆26Updated 10 months ago
- Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.☆100Updated 2 years ago
- [NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆15Updated 9 months ago
- Unofficial implementation of "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" (https://arxiv.org/abs/2103.14030)☆64Updated 3 years ago
- ☆14Updated 3 years ago
- Implementation of MobileViT in TensorFlow and Keras☆11Updated last year
- ModelSoups for Tensorflow2 and Torch☆47Updated 2 years ago
- A Detection Toolbox for Tensorflow2☆56Updated last year
- A summarization of Transformer-based architectures for CV tasks, including image classification, object detection, segmentation, and Few-…☆106Updated 2 years ago
- An implementation of the X3D video recognition architecture in TensorFlow/Keras☆15Updated 3 years ago
- 1st place solution of RSNA Screening Mammography Breast Cancer Detection competition on Kaggle: https://www.kaggle.com/competitions/rsna-…☆78Updated last year
- A PyTorch implementation of EfficientNet☆167Updated last year
- PyTorch Lighning wrapper to make training ResNet classifiers easier.☆27Updated last year
- menovideo: pytorch library for video action recognition and video understanding☆28Updated 3 years ago
- 2nd Place Solution for the RSNA 2023 Abdominal Trauma Detection Kaggle Competition☆36Updated this week
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆291Updated 7 months ago
- Visualizing the learned space-time attention using Attention Rollout☆32Updated 2 years ago