innat / VideoSwinLinks
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
☆32Updated 3 months ago
Alternatives and similar repositories for VideoSwin
Users that are interested in VideoSwin are comparing it to the libraries listed below
Sorting:
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆101Updated last year
- Easiest way of fine-tuning HuggingFace video classification models☆147Updated 2 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆94Updated last year
- Action recognition tutorial using UCF-101 dataset.☆29Updated 4 years ago
- Self-Supervised Learning in PyTorch☆143Updated last year
- Video classification exercise using UCF101 data for training an early-fusion and SlowFast architecture model, both using the PyTorch Ligh…☆15Updated 4 years ago
- Official implementation of "Delving into CLIP latent space for Video Anomaly Recognition", CVIU 2024☆93Updated 4 months ago
- Code Release for MViTv2 on Image Recognition.☆450Updated last year
- Awesome Fine-Grained Image Classification☆101Updated 4 months ago
- [NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆22Updated 2 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆134Updated 2 years ago
- Normalizing Flows for Human Pose Anomaly Detection [ICCV 2023]☆97Updated 2 years ago
- menovideo: pytorch library for video action recognition and video understanding☆29Updated 4 years ago
- Official code repository for ICML 2025 paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Doma…☆51Updated last month
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…☆79Updated last year
- Video Swin Transformer - PyTorch☆265Updated 4 years ago
- Exploring the applicability of Grad-CAM for explanation in video based dataset☆33Updated 2 years ago
- Visualizing the learned space-time attention using Attention Rollout☆40Updated 3 years ago
- This folder of code contains code and notebooks to supplement the "Vision Transformers Explained" series published on Towards Data Scienc…☆96Updated last year
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆305Updated 3 years ago
- This is an official implementation for "Attention-based Residual Autoencoder for Video Anomaly Detection".☆127Updated last year
- Video Summarization With Spatiotemporal Vision Transformer☆23Updated 2 years ago
- Implementation of brand new video augmentation strategy for video action recognition with 3D CNN☆27Updated 4 years ago
- Awesome Video Anomaly Detection☆104Updated 5 months ago
- non-official NoisyNN Implemnentation☆50Updated 2 years ago
- [CVPR2024] Learning CNN on ViT: A Hybrid Model to Explicitly Class-specific Boundaries for Domain Adaptation☆38Updated last year
- An implementation of the paper "End-to-End Human-Gaze-Target Detection with Transformers"☆19Updated last year
- Library to perform image and video self-supervised learning.☆54Updated 2 weeks ago
- Data-aware Fine-Tuning (DAFT) Code related to the CPVR24 Competition for Medical Image Segmentation on a Laptop.☆36Updated 10 months ago
- Easy to use class balanced cross entropy and focal loss implementation for Pytorch☆98Updated last year