innat / VideoSwinLinks
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
☆34Updated 8 months ago
Alternatives and similar repositories for VideoSwin
Users that are interested in VideoSwin are comparing it to the libraries listed below
Sorting:
- Easiest way of fine-tuning HuggingFace video classification models☆142Updated 2 years ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆102Updated last year
- [NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆22Updated last year
- Awesome Fine-Grained Image Classification☆86Updated 11 months ago
- Self-Supervised Learning in PyTorch☆138Updated last year
- Action recognition tutorial using UCF-101 dataset.☆28Updated 3 years ago
- This repository demonstrates how to use TensorFlow based SegFormer model in 🤗 transformers package.☆30Updated 3 years ago
- Fine-tune Facebook's DETR (DEtection TRansformer) on Colaboratory.☆151Updated 2 years ago
- ☆76Updated last month
- Vision Transformers for image classification, image segmentation, and object detection.☆56Updated 9 months ago
- Easy to use class balanced cross entropy and focal loss implementation for Pytorch☆96Updated 7 months ago
- non-official NoisyNN Implemnentation☆50Updated last year
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…☆79Updated 10 months ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆92Updated 11 months ago
- Official implementation of "Delving into CLIP latent space for Video Anomaly Recognition", CVIU 2024☆71Updated 8 months ago
- ModelSoups for Tensorflow2 and Torch☆49Updated 3 years ago
- Code Release for MViTv2 on Image Recognition.☆433Updated 8 months ago
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆300Updated 3 years ago
- ☆27Updated 3 months ago
- [ICCV25] Official Implementation of LeGrad☆78Updated 9 months ago
- End-to-End Object Detection with Transformers☆51Updated last month
- Pytorch implementation of "Fine-grained Visual Classification with High-temperature Refinement and Background Suppression"☆108Updated last year
- A Keras implementation of hybrid efficientnet swin transformer model.☆34Updated last year
- GroundedSAM Base Model plugin for Autodistill☆51Updated last year
- Based on our paper on skin lesion segmentation: "MFSNet: A Multi Focus Segmentation Network for Skin Lesion Segmentation"☆16Updated 3 years ago
- menovideo: pytorch library for video action recognition and video understanding☆29Updated 3 years ago
- Easy-to-read implementation of self-supervised learning using vision transformer and knowledge distillation with no labels - DINO☆29Updated 2 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆133Updated 2 years ago
- This folder of code contains code and notebooks to supplement the "Vision Transformers Explained" series published on Towards Data Scienc…☆86Updated last year
- Normalizing Flows for Human Pose Anomaly Detection [ICCV 2023]☆88Updated last year