GuyKabiri / Video-Classification
Video classification exercise using UCF101 data for training an early-fusion and SlowFast architecture model, both using the PyTorch Lightning framework.
☆13Updated 2 years ago
Related projects: ⓘ
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆83Updated last week
- Easiest way of fine-tuning HuggingFace video classification models☆131Updated last year
- Official source code for "Continual 3D Convolutional Neural Networks for Real-time Processing of Videos" [ECCV2022]☆42Updated last year
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆96Updated 2 years ago
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆32Updated 6 months ago
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆126Updated 3 years ago
- ☆68Updated 11 months ago
- menovideo: pytorch library for video action recognition and video understanding☆28Updated 2 years ago
- Visualizing the learned space-time attention using Attention Rollout☆27Updated 2 years ago
- ☆51Updated 2 years ago
- "Object-Region Video Transformers”, Herzig et al., CVPR 2022☆42Updated 2 years ago
- ☆66Updated last year
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆23Updated last year
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆37Updated 2 years ago
- ☆8Updated 2 years ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆84Updated 4 months ago
- Action recognition tutorial using UCF-101 dataset.☆22Updated 2 years ago
- Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.☆28Updated last year
- Official code repository for SPAct: Self-supervised Privacy Preservation for Action Recognition [CVPR-2022]☆21Updated 2 years ago
- Implementations of Transformers for Video☆24Updated 3 years ago
- The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-languag…☆215Updated 2 years ago
- Semi-Supervised Action Recognition with Temporal Contrastive Learning☆56Updated 5 months ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆100Updated 2 months ago
- Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.☆70Updated 2 years ago
- Exploring the applicability of Grad-CAM for explanation in video based dataset☆23Updated last year
- Attribution (or visual explanation) methods for understanding video classification networks. Demo codes for WACV2021 paper: Towards Visua…☆19Updated 11 months ago
- BEAR: a new BEnchmark on video Action Recognition☆40Updated 4 months ago
- Transformer for Action Recognition in PyTorch☆37Updated 4 years ago
- Code release for ICCV 2021 paper "Anticipative Video Transformer"☆152Updated 2 years ago
- The notebook explains the various steps to obtain the results of publication: "Is Space-Time Attention All You Need for Video Understandi…☆40Updated 3 years ago