KSonPham / ViVit-a-Pytorch-implementation
☆21Updated 2 years ago
Alternatives and similar repositories for ViVit-a-Pytorch-implementation:
Users that are interested in ViVit-a-Pytorch-implementation are comparing it to the libraries listed below
- Implementation of ViViT: A Video Vision Transformer☆526Updated 3 years ago
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆292Updated 2 years ago
- ☆306Updated last month
- ☆13Updated 11 months ago
- avoskou / Stochastic-Transformer-Networks-with-Linear-Competing-Units-Application-to-end-to-end-SL-Translatio☆19Updated 2 years ago
- Continuous Sign Language Recognition with Correlation Network (CVPR 2023)☆119Updated 2 months ago
- [CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate E…☆19Updated 9 months ago
- Video Swin Transformer - PyTorch☆251Updated 3 years ago
- ☆54Updated last year
- ☆831Updated 10 months ago
- MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;☆265Updated 2 years ago
- ☆66Updated 3 years ago
- Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation☆476Updated this week
- Paper list of sign language, including sign language recognition(SLR), sign language translation(SLT) and other interesting work. Quick s…☆112Updated 2 weeks ago
- Self-Emphasizing Network for Continuous Sign Language Recognition (AAAI2023 Oral)☆48Updated 2 months ago
- An Attention Based Approach to Sign Language Recognition | SOTA on WLASL Joints | https://arxiv.org/abs/2212.10746☆15Updated last year
- Visual Alignment Constraint for Continuous Sign Language Recognition. ( ICCV 2021)☆129Updated 2 years ago
- ☆18Updated last week
- Code Release for MViTv2 on Image Recognition.☆421Updated 4 months ago
- This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sig…☆217Updated 2 years ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆307Updated 11 months ago
- Sign Language Transformers (CVPR'20)☆252Updated 8 months ago
- Code release for ActionFormer (ECCV 2022)☆474Updated 11 months ago
- Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…☆583Updated last month
- Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.☆459Updated 2 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆123Updated last year
- Official code for "Learning Prompt-Enhanced Context features for Weakly-Supervised Video Anomlay Detection" (IEEE-TIP)☆83Updated 7 months ago
- This is an official implementation for "Video Swin Transformers".☆1,521Updated 2 years ago
- Temporal Lift Pooling for Continuous Sign Language Recognition (ECCV2022)☆22Updated 8 months ago
- 🌟 Code for ACL 2023 paper "GloFE: Gloss-Free End-to-End Sign Language Translation" (Oral)☆37Updated last year