Jiewen-Yang / RViT
The Code For ''Recurring the Transformer for Video Action Recognition''
☆13Updated last year
Alternatives and similar repositories for RViT:
Users that are interested in RViT are comparing it to the libraries listed below
- ☆19Updated 2 years ago
- Video Swin Transformer - PyTorch☆253Updated 3 years ago
- Code release for ActionFormer (ECCV 2022)☆483Updated last year
- fourierer / Video_Classification_ResNet3D_R2plus1D_ip-CSN_train-UCF101-HMDB51-Kinetics400-from-scratchUsing ResNet3D-50,R(2+1)D-50, and ip_CSN-50 to train UCD-101,HMDB-51 and Kinetics-400 from scratch.☆28Updated 4 years ago
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆181Updated last year
- I3D features extractor with resnet50 backbone☆74Updated 2 years ago
- Video feature extraction pipeline that supports diverse models including I3D, SlowFast, EgoVLP, and CLIP.☆13Updated last year
- ☆40Updated last year
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆294Updated 2 years ago
- ☆80Updated 5 years ago
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆59Updated 3 months ago
- Code for I3D Feature Extraction☆147Updated 5 years ago
- Implementation of ViViT: A Video Vision Transformer☆528Updated 3 years ago
- Code for Diffusion Action Segmentation (ICCV 2023)☆61Updated last year
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆550Updated last year
- Official repo for BMVC2021 paper ASFormer: Transformer for action segmentation☆105Updated 3 years ago
- Take an Emotion Walk: Perceiving Emotions from Gaits Using Hierarchical Attention Pooling and Affective Mapping☆12Updated 4 years ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆623Updated 6 months ago
- A curated list of awesome temporal action segmentation resources.☆196Updated last year
- Spatial Temporal Graph Convolutional Networks for Emotion Perception from Gaits☆74Updated 2 years ago
- [ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"☆300Updated 3 years ago
- [CVPR 2023] Neural Koopman Pooling: Control-Inspired Temporal Dynamics Encoding for Skeleton-Based Action Recognition☆37Updated 6 months ago
- Normalizing Flows for Human Pose Anomaly Detection [ICCV 2023]☆84Updated last year
- Awesome Action Quality Assessment (AQA)☆68Updated 2 weeks ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆309Updated last year
- Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation☆482Updated this week
- This is the repository for MMASD: A Multimodal Dataset for Autism Intervention Analysis.☆23Updated last year
- Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…☆591Updated 2 months ago
- Official implementation for "InfoGCN: Representation Learning for Human Skeleton-Based Action Recognition"☆125Updated 2 years ago
- (CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization☆18Updated 10 months ago