innat / VideoSwinLinks
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
☆33Updated last week
Alternatives and similar repositories for VideoSwin
Users that are interested in VideoSwin are comparing it to the libraries listed below
Sorting:
- Easiest way of fine-tuning HuggingFace video classification models☆145Updated 2 years ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆100Updated last year
- [NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆22Updated last year
- Code Release for MViTv2 on Image Recognition.☆443Updated 11 months ago
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆303Updated 3 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆93Updated last year
- Official implementation of "Delving into CLIP latent space for Video Anomaly Recognition", CVIU 2024☆88Updated last month
- The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-languag…☆230Updated 3 years ago
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…☆79Updated last year
- Library to perform image and video self-supervised learning.☆53Updated last year
- Self-Supervised Learning in PyTorch☆142Updated last year
- Action recognition tutorial using UCF-101 dataset.☆28Updated 3 years ago
- Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"☆93Updated 10 months ago
- An implementation of the X3D video recognition architecture in TensorFlow/Keras☆15Updated 4 years ago
- Video Swin Transformer - PyTorch☆266Updated 3 years ago
- ☆44Updated 4 months ago
- This is an official implementation for "Attention-based Residual Autoencoder for Video Anomaly Detection".☆124Updated 9 months ago
- Implementation of Deep Orthogonal Fusion of Local and Global Features in TensorFlow 2☆26Updated 2 years ago
- This repository demonstrates how to use TensorFlow based SegFormer model in 🤗 transformers package.☆30Updated 3 years ago
- Video Summarization With Spatiotemporal Vision Transformer☆22Updated 2 years ago
- Video classification exercise using UCF101 data for training an early-fusion and SlowFast architecture model, both using the PyTorch Ligh…☆15Updated 3 years ago
- ☆29Updated 6 months ago
- Visualizing the learned space-time attention using Attention Rollout☆37Updated 3 years ago
- Implementation of brand new video augmentation strategy for video action recognition with 3D CNN☆27Updated 4 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆133Updated 2 years ago
- "Tail-Aware Sperm Analysis for Transparent Tracking of Spermatozoa" Official Implementation☆10Updated 7 months ago
- GroundedSAM Base Model plugin for Autodistill☆52Updated last year
- [TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos☆51Updated last year
- Fine-tune Facebook's DETR (DEtection TRansformer) on Colaboratory.☆150Updated 2 years ago
- Keras (TensorFlow v2) reimplementation of Swin Transformer V1 and V2 models☆22Updated last year