KSonPham / ViVit-a-Pytorch-implementationLinks
☆22Updated 2 years ago
Alternatives and similar repositories for ViVit-a-Pytorch-implementation
Users that are interested in ViVit-a-Pytorch-implementation are comparing it to the libraries listed below
Sorting:
- Implementation of ViViT: A Video Vision Transformer☆539Updated 4 years ago
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆299Updated 3 years ago
- Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…☆603Updated 5 months ago
- ☆871Updated last year
- ☆330Updated 4 months ago
- This is an official implementation for "Video Swin Transformers".☆1,564Updated 2 years ago
- Code release for ActionFormer (ECCV 2022)☆507Updated last year
- Continuous Sign Language Recognition with Correlation Network (CVPR 2023)☆127Updated 5 months ago
- Video Swin Transformer - PyTorch☆260Updated 3 years ago
- Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.☆463Updated 2 years ago
- ☆68Updated 4 years ago
- [ICASSP 2024] Official code for Slowfast Network for Continuous Sign Language Recognition☆50Updated 2 weeks ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆656Updated 9 months ago
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,720Updated last year
- Sign Language Transformers (CVPR'20)☆269Updated 11 months ago
- avoskou / Stochastic-Transformer-Networks-with-Linear-Competing-Units-Application-to-end-to-end-SL-Translatio☆19Updated 3 years ago
- Exploring the applicability of Grad-CAM for explanation in video based dataset☆32Updated last year
- CorrNet+: Sign Language Recognition and Translation via Spatial-Temporal Correlation☆27Updated 5 months ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,541Updated last year
- ☆23Updated 3 months ago
- Repository accompanying the "Sign Pose-based Transformer for Word-level Sign Language Recognition" paper☆90Updated 2 years ago
- Paper list of sign language, including sign language recognition(SLR), sign language translation(SLT) and other interesting work. Quick s…☆132Updated 2 months ago
- ☆62Updated last year
- [CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate E…☆26Updated last year
- Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation☆514Updated last month
- Code Release for MViTv2 on Image Recognition.☆432Updated 7 months ago
- Online and real-time violence recognition☆15Updated 3 years ago
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆563Updated last year
- Visual Alignment Constraint for Continuous Sign Language Recognition. ( ICCV 2021)☆133Updated 2 years ago
- Explainability for Vision Transformers☆982Updated 3 years ago