dominickrei / pi-vit
[CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living
☆19Updated last month
Alternatives and similar repositories for pi-vit:
Users that are interested in pi-vit are comparing it to the libraries listed below
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆23Updated last year
- [CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning☆38Updated 10 months ago
- [ACMMM 2023] Skeleton-MixFormer: Multivariate Topology Representation for Skeleton-based Action Recognition☆21Updated last year
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆54Updated last month
- The official project website of "Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition" (The paper of Ske2Grid is pub…☆20Updated last year
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆49Updated 2 months ago
- Disentangled Pre-training for Human-Object Interaction Detection☆19Updated 4 months ago
- ☆50Updated last year
- ☆19Updated 11 months ago
- (CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos…☆29Updated 11 months ago
- [ACMMM 2024] Implementation of the paper “Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition“.☆38Updated last week
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆30Updated last year
- EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams [CVPR'24]☆24Updated 7 months ago
- Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"☆39Updated 4 months ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆59Updated 2 years ago
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆26Updated 9 months ago
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆59Updated 2 months ago
- Official implementation of the ICCV 2023 paper "Masked Motion Predictors are Strong 3D Action Representation Learners"☆44Updated last year
- [CVPR 2023] Official PyTorch implementation of the paper "GAP: Post-Processing Temporal Action Detection"☆17Updated last year
- ☆33Updated 10 months ago
- ☆32Updated 2 years ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆74Updated 2 months ago
- CAPE using text-graphs☆19Updated last month
- IJCAI 2024 Shap-Mix: Shapley Value Guided Mixing for Long-Tailed Skeleton Based Action Recognition☆12Updated 4 months ago
- Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”☆9Updated last year
- [WACV 2021] Selective Spatio-Temporal Aggregation based Pose Refinement System: Towards understanding human activities in real-world vide…☆13Updated 3 years ago
- BEAR: a new BEnchmark on video Action Recognition☆42Updated 11 months ago
- SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]☆19Updated 2 years ago
- PoseRAC: Pose Saliency Transformer for Repetitive Action Counting☆15Updated last year
- This is the official implementation of our CVPR 2024 paper "BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition"☆87Updated 8 months ago