KSonPham / ViVit-a-Pytorch-implementationLinks
☆22Updated 3 years ago
Alternatives and similar repositories for ViVit-a-Pytorch-implementation
Users that are interested in ViVit-a-Pytorch-implementation are comparing it to the libraries listed below
Sorting:
- Implementation of ViViT: A Video Vision Transformer☆555Updated 4 years ago
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆305Updated 3 years ago
- ☆915Updated last year
- Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…☆629Updated 10 months ago
- ☆363Updated 9 months ago
- Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.☆466Updated 2 years ago
- Exploring the applicability of Grad-CAM for explanation in video based dataset☆33Updated 2 years ago
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,795Updated last year
- Video Swin Transformer - PyTorch☆266Updated 3 years ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆712Updated last year
- Code release for ActionFormer (ECCV 2022)☆528Updated last year
- Continuous Sign Language Recognition with Correlation Network (CVPR 2023)☆143Updated last week
- Code Release for MViTv2 on Image Recognition.☆447Updated last year
- This is an official implementation for "Video Swin Transformers".☆1,606Updated 2 years ago
- An Attention Based Approach to Sign Language Recognition | SOTA 2022 on WLASL Joints | https://arxiv.org/abs/2212.10746☆21Updated 6 months ago
- ☆30Updated 8 months ago
- ☆70Updated 4 years ago
- CorrNet+: Sign Language Recognition and Translation via Spatial-Temporal Correlation☆32Updated 10 months ago
- [CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate E…☆27Updated last year
- Sign Language Transformers (CVPR'20)☆282Updated last year
- Explainability for Vision Transformers☆1,019Updated 3 years ago
- Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation☆549Updated this week
- Repository accompanying the "Sign Pose-based Transformer for Word-level Sign Language Recognition" paper☆94Updated 2 years ago
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆592Updated last year
- avoskou / Stochastic-Transformer-Networks-with-Linear-Competing-Units-Application-to-end-to-end-SL-Translatio☆17Updated 3 years ago
- [ICLR2022] official implementation of UniFormer☆889Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,622Updated last year
- Paper list of sign language, including sign language recognition(SLR), sign language translation(SLT) and other work. Quick start your aw…☆174Updated last month
- 🌟 Code for ACL 2023 paper "GloFE: Gloss-Free End-to-End Sign Language Translation" (Oral)☆39Updated 2 years ago
- A curated list of awesome temporal action segmentation resources.☆228Updated last year