[CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living
☆30Nov 12, 2025Updated 3 months ago
Alternatives and similar repositories for pi-vit
Users that are interested in pi-vit are comparing it to the libraries listed below
Sorting:
- This is the offical repository of LLAVIDAL☆23Oct 4, 2025Updated 5 months ago
- ☆47Nov 24, 2023Updated 2 years ago
- IJCAI 2024 Shap-Mix: Shapley Value Guided Mixing for Long-Tailed Skeleton Based Action Recognition☆14Nov 25, 2024Updated last year
- Pytorch I3D implmentation on Toyota Smarthome Dataset☆17Apr 23, 2022Updated 3 years ago
- A code support using OpenCV, Yolo, SimCC, SMPLX, Open3D, FBX-SDK, Blender and Maya.☆43Oct 23, 2024Updated last year
- ☆15Nov 15, 2023Updated 2 years ago
- ☆20Jan 29, 2023Updated 3 years ago
- [ACMMM 2024] Implementation of the paper “Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition“.☆44Mar 21, 2025Updated 11 months ago
- [CVPR 2026] Official Repository of 'MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos'☆37Jan 23, 2026Updated last month
- The official codebase of FineAction dataset. We will update the data and code of our FineAction.☆22Apr 10, 2025Updated 11 months ago
- [CAC2023] Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation☆11Nov 28, 2024Updated last year
- [WACV 2024] Code for "Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders"☆25Aug 16, 2024Updated last year
- [ACMMM 2023] Skeleton-MixFormer: Multivariate Topology Representation for Skeleton-based Action Recognition☆24Sep 24, 2023Updated 2 years ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆135May 21, 2023Updated 2 years ago
- [ECCV 2024] The official repo for "SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoder…☆37Jul 19, 2024Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Collaboration of two technologies (Machine Learning and TCAD) to improve the productivity in Semiconductor manufacturing industry☆10May 3, 2019Updated 6 years ago
- Deep Learning Development Documents☆10Feb 15, 2026Updated 3 weeks ago
- Repository for CHUNGUS (RA-L 2025)☆16May 2, 2025Updated 10 months ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- [ICCV 2023] Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition☆160Oct 5, 2023Updated 2 years ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆56Aug 26, 2025Updated 6 months ago
- UVA-Human-Skeleton-Preprocessing☆10May 4, 2023Updated 2 years ago
- The official implementation of GCLSS (Generalized CLSS) and CLSS (NeurIPS 2023: Semi-Supervised Contrastive Learning for Deep Regression …☆13Aug 29, 2025Updated 6 months ago
- Easy to install Text to Speech system for Raspberry Pi 4☆14Mar 4, 2024Updated 2 years ago
- Official implementation of our CVPR'22 paper.☆13Nov 18, 2022Updated 3 years ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 8 months ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- ☆18May 15, 2025Updated 9 months ago
- DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations (CVPR 2025)☆12Jun 1, 2025Updated 9 months ago
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- Zero-Cost Whole-Body Teleoperation for Mobile Manipulation☆11Mar 4, 2025Updated last year
- [ICCV 2023] Code for "Multi-task View Synthesis with Neural Radiance Fields"☆11Oct 2, 2023Updated 2 years ago
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation☆11Nov 30, 2025Updated 3 months ago
- Repository for evaluating Pegasus-1 and video-language foundation models☆14Nov 12, 2024Updated last year
- 🔥[T-ITS 2025, Official Code] for paper "Evidence-based Real-time Road Segmentation with RGB-D Data Augmentation". Official Weights and D…☆15Apr 29, 2025Updated 10 months ago
- [AAAI 2025] Official pytorch implementation of "Diffusion Model Patching via Mixture-of-Prompts"☆13Dec 12, 2024Updated last year
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year