[CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living
☆31Nov 12, 2025Updated 6 months ago
Alternatives and similar repositories for pi-vit
Users that are interested in pi-vit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers☆21Aug 2, 2024Updated last year
- This is the offical repository of LLAVIDAL☆24Oct 4, 2025Updated 7 months ago
- IJCAI 2024 Shap-Mix: Shapley Value Guided Mixing for Long-Tailed Skeleton Based Action Recognition☆15Nov 25, 2024Updated last year
- [CVPR 2026] Official Repository of 'MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos'☆45Jan 23, 2026Updated 4 months ago
- Pytorch I3D implmentation on Toyota Smarthome Dataset☆17Apr 23, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆48Nov 24, 2023Updated 2 years ago
- A code support using OpenCV, Yolo, SimCC, SMPLX, Open3D, FBX-SDK, Blender and Maya.☆46Oct 23, 2024Updated last year
- SI-MIL☆36Jan 3, 2025Updated last year
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- ☆14Nov 15, 2023Updated 2 years ago
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- ☆20Jan 29, 2023Updated 3 years ago
- Tools for Toyota Smarthome datasets☆14Nov 16, 2022Updated 3 years ago
- [ACMMM 2023] Skeleton-MixFormer: Multivariate Topology Representation for Skeleton-based Action Recognition☆24Sep 24, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- The Official PyTorch implementation of "Part Aware Contrastive Learning for Self-Supervised Action Recognition" in IJCAI 2023☆13Nov 9, 2023Updated 2 years ago
- [NeurIPS 2024] CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition☆16Nov 12, 2025Updated 6 months ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Apr 20, 2023Updated 3 years ago
- This is a repo of extension of VPN for Recognition of Activities of Daily Living☆16May 17, 2021Updated 5 years ago
- Placeholder☆10Jul 17, 2023Updated 2 years ago
- [ICCV'23] PAINet: Parallel Attention Interaction Network for Few-shot Skeleton-based Action Recognition☆11Oct 14, 2023Updated 2 years ago
- Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)☆15Apr 23, 2024Updated 2 years ago
- ☆12Apr 28, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official codebase of FineAction dataset. We will update the data and code of our FineAction.☆24Apr 10, 2025Updated last year
- [ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events☆10Dec 7, 2024Updated last year
- [CAC2023] Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation☆11Nov 28, 2024Updated last year
- [AAAI 2025] Official Repository of 'SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living'☆23Sep 17, 2025Updated 8 months ago
- Deformable Graph Convolutional Networks (Author's PyTorch implementation for the AAAI 2022 paper)☆27Sep 22, 2022Updated 3 years ago
- Official Implementation of our ICML 2025 paper: "D-MoLE: Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction …☆27Jan 11, 2026Updated 4 months ago
- [Codes of paper]: Busy-Quiet Video Disentangling for Video Classification☆14Jan 17, 2022Updated 4 years ago
- A project about deploying a yolo server to support inferring image sent by different clients.☆10Mar 23, 2024Updated 2 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆135May 21, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Public code release for SIGGRAPH 2021 paper: ShapeMOD: Macro Operation Discovery for 3D Shape Programs☆13Sep 8, 2021Updated 4 years ago
- "Linear Regression vs. Deep Learning". The source code for a simple but effective baseline method for human body measurement estimation u…☆10Jan 19, 2023Updated 3 years ago
- [ACM MM 2024] Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer☆21Apr 28, 2026Updated last month
- Fingers enrolled using the R305 module and stored in a database along with the person's name. Next just by typing the persons name, his/h…☆13Jul 8, 2018Updated 7 years ago
- Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.☆11Jul 28, 2022Updated 3 years ago
- [ICCV 2023] Latent Action Composition for Skeleton-based Action Segmentation☆22Oct 25, 2023Updated 2 years ago
- Tensorflow Implementation of Deep Metric Learning with Angular Loss☆17Dec 3, 2018Updated 7 years ago