KSonPham / ViVit-a-Pytorch-implementationLinks

☆22

Alternatives and similar repositories for ViVit-a-Pytorch-implementation

Users that are interested in ViVit-a-Pytorch-implementation are comparing it to the libraries listed below

Sorting:

rishikksh20 / ViViT-pytorch
Implementation of ViViT: A Video Vision Transformer
☆537Updated 4 years ago
mx-mark / VideoTransformer-pytorch
PyTorch implementation of a collections of scalable Video Transformer Benchmarks.
☆298Updated 3 years ago
drv-agwl / ViViT-pytorch
☆67Updated 4 years ago
happyharrycn / actionformer_release
Code release for ActionFormer (ECCV 2022)
☆496Updated last year
haofanwang / video-swin-transformer-pytorch
Video Swin Transformer - PyTorch
☆256Updated 3 years ago
cvdfoundation / kinetics-dataset
☆861Updated last year
v-iashin / video_features
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…
☆599Updated 4 months ago
OpenGVLab / UniFormerV2
[ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
☆317Updated last year
sallymmx / ActionCLIP
This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"
☆559Updated last year
dingfengshi / TriDet
[CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling
☆185Updated last year
GowthamGottimukkala / I3D_Feature_Extraction_resnet
I3D features extractor with resnet50 backbone
☆73Updated 2 years ago
zhenyingfang / Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation
☆507Updated this week
SwinTransformer / Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
☆1,557Updated 2 years ago
Atze00 / MoViNet-pytorch
MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;
☆274Updated 3 years ago
nus-cvml / awesome-temporal-action-segmentation
A curated list of awesome temporal action segmentation resources.
☆203Updated last year
bomri / SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆86Updated 3 years ago
davide-coccomini / TimeSformer-Video-Classification
The notebook explains the various steps to obtain the results of publication: "Is Space-Time Attention All You Need for Video Understandi…
☆42Updated 4 years ago
OpenGVLab / VideoMAEv2
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
☆648Updated 8 months ago
UtopAIBuilder / Grad-CAM-for-video-and-regression-task
Exploring the applicability of Grad-CAM for explanation in video based dataset
☆32Updated last year
lucidrains / TimeSformer-pytorch
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
☆719Updated 3 years ago
RaivoKoot / Video-Dataset-Loading-Pytorch
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
☆462Updated 2 years ago
avoskou / Stochastic-Transformer-Networks-with-Linear-Competing-Units-Application-to-end-to-end-SL-Translatio
☆19Updated 3 years ago
Dotori-HJ / TE-TAD
[CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate E…
☆25Updated 11 months ago
stnoah1 / infogcn
Official implementation for "InfoGCN: Representation Learning for Human Skeleton-Based Action Recognition"
☆125Updated 2 years ago
facebookresearch / mvit
Code Release for MViTv2 on Image Recognition.
☆427Updated 6 months ago
hulianyuyy / CorrNet
Continuous Sign Language Recognition with Correlation Network (CVPR 2023)
☆124Updated 4 months ago
kylemin / S3D
Release of the pretrained S3D Network in PyTorch (ECCV 2018)
☆133Updated last year
WoominM / DeGCN_pytorch
Official PyTorch implementation of "DeGCN : Deformable Graph Convolutional Networks for Skeleton-Based Action Recognition"
☆45Updated last year
Jho-Yonsei / HD-GCN
[ICCV 2023] Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition
☆147Updated last year
MCG-NJU / VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,520Updated last year