dominickrei / pi-vit
[CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living
☆12Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for pi-vit
- [CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning☆35Updated 5 months ago
- Codebase for "Every Shot Counts: Using Exemplars for Repetition Counting in Videos"☆19Updated 3 weeks ago
- Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"☆37Updated this week
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆32Updated 4 months ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆31Updated last year
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆90Updated 4 months ago
- ☆57Updated last year
- ☆42Updated 10 months ago
- ☆17Updated 7 months ago
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆24Updated 3 weeks ago
- The official project website of "Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition" (The paper of Ske2Grid is pub…☆20Updated last year
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆30Updated 2 months ago
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆87Updated this week
- SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]☆19Updated last year
- CAPE using text-graphs☆12Updated 5 months ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆20Updated 6 months ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆22Updated last year
- Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024☆45Updated last month
- 🔥 [ECCV 2024] Motion Mamba: Efficient and Long Sequence Motion Generation☆112Updated 3 weeks ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆31Updated this week
- ☆52Updated 2 years ago
- Code for Diffusion Action Segmentation (ICCV 2023)☆52Updated last year
- PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆12Updated last month
- ☆30Updated last month
- [CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings☆44Updated last year
- Official implementation of the ICCV 2023 paper "Masked Motion Predictors are Strong 3D Action Representation Learners"☆39Updated last year
- Official Implementation of the paper "Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Transl…☆32Updated last year
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆12Updated 3 weeks ago
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆114Updated 3 months ago
- Multimodal Video Understanding Framework (MVU)☆23Updated 5 months ago