KSonPham / ViVit-a-Pytorch-implementation
☆18Updated 2 years ago
Alternatives and similar repositories for ViVit-a-Pytorch-implementation:
Users that are interested in ViVit-a-Pytorch-implementation are comparing it to the libraries listed below
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆288Updated 2 years ago
- Implementation of ViViT: A Video Vision Transformer☆522Updated 3 years ago
- Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…☆567Updated 3 weeks ago
- ☆67Updated 3 years ago
- Video Swin Transformer - PyTorch☆243Updated 3 years ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆582Updated 4 months ago
- ☆299Updated this week
- avoskou / Stochastic-Transformer-Networks-with-Linear-Competing-Units-Application-to-end-to-end-SL-Translatio☆19Updated 2 years ago
- ☆13Updated 9 months ago
- Explainability for Vision Transformers☆907Updated 2 years ago
- ☆813Updated 9 months ago
- Code Release for MViTv2 on Image Recognition.☆416Updated 2 months ago
- Continuous Sign Language Recognition with Correlation Network (CVPR 2023)☆110Updated 3 weeks ago
- [CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate E…☆17Updated 7 months ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆304Updated 10 months ago
- An Attention Based Approach to Sign Language Recognition | SOTA on WLASL Joints | https://arxiv.org/abs/2212.10746☆15Updated last year
- Sign Language Transformers (CVPR'20)☆250Updated 6 months ago
- An unofficial implementation of ViTPose [Y. Xu et al., 2022]☆113Updated last year
- Paper list of sign language, including sign language recognition(SLR), sign language translation(SLT) and other interesting work. Quick s…☆107Updated 3 weeks ago
- MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;☆261Updated 2 years ago
- Visual Alignment Constraint for Continuous Sign Language Recognition. ( ICCV 2021)☆126Updated last year
- This is an official implementation for "Video Swin Transformers".☆1,497Updated last year
- Official Implementation of Visual Transformer Pooling for Lip reading☆40Updated 2 years ago
- Official implementation for "Ham2Pose: Animating Sign Language Notation into Pose Sequences" [CVPR 2023]☆48Updated 7 months ago
- Dual Swin Transformer for video-time-series fusion☆15Updated 5 months ago
- This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sig…☆215Updated 2 years ago
- Code release for ActionFormer (ECCV 2022)☆461Updated 10 months ago
- ☆50Updated 11 months ago
- ☆24Updated last year
- ☆12Updated last year