Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
☆21Aug 2, 2024Updated last year
Alternatives and similar repositories for PoseAwareVT
Users that are interested in PoseAwareVT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"☆17Oct 6, 2025Updated 7 months ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- This is the offical repository of LLAVIDAL☆24Oct 4, 2025Updated 7 months ago
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆31Nov 12, 2025Updated 6 months ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Apr 20, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [WACV 2024] Code for "Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders"☆25Aug 16, 2024Updated last year
- [WIP] Code for LangToMo☆21Mar 19, 2026Updated 2 months ago
- Environments for Active Vision Reinforcement Learning☆30Oct 10, 2024Updated last year
- ☆20Jan 29, 2023Updated 3 years ago
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"☆36Jun 17, 2024Updated last year
- [ICRA'24] Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning☆72Aug 4, 2024Updated last year
- Pose driven attention mechanism☆44Mar 31, 2022Updated 4 years ago
- WACV 2024: "PathLDM: Text conditioned Latent Diffusion Model for Histopathology"☆50Jul 7, 2024Updated last year
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Nov 7, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆56Jan 31, 2025Updated last year
- Code for NeurIPS 2023 paper "Active Vision Reinforcement Learning with Limited Visual Observability"☆54Oct 10, 2024Updated last year
- Tools for Toyota Smarthome datasets☆14Nov 16, 2022Updated 3 years ago
- A Decade of Action Quality Assessment: Largest Systematic Survey of Trends, Challenges, and Future Directions☆15Jan 22, 2026Updated 4 months ago
- The Official PyTorch implementation of "Part Aware Contrastive Learning for Self-Supervised Action Recognition" in IJCAI 2023☆13Nov 9, 2023Updated 2 years ago
- Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos☆30Oct 27, 2025Updated 7 months ago
- Pytorch I3D implmentation on Toyota Smarthome Dataset☆17Apr 23, 2022Updated 4 years ago
- Package (ROS 1 & ROS 2) for human keypoints identification, 3D reconstruction, tracking, and filtering in collaborative robotics.☆18Nov 20, 2025Updated 6 months ago
- [CAC2023] Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation☆11Nov 28, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Jun 25, 2022Updated 3 years ago
- This code is provided for reproducibility of results in the paper: Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve A…☆23Feb 6, 2025Updated last year
- Code for our WACV 2021 paper "Exploiting the Redundancy in Convolutional Filters for Parameter Reduction"☆11Jan 6, 2021Updated 5 years ago
- [AAAI 2025] Official Repository of 'SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living'☆23Sep 17, 2025Updated 8 months ago
- ☆25Dec 22, 2024Updated last year
- ☆18May 21, 2024Updated 2 years ago
- Parisian sidewalks☆14Mar 1, 2021Updated 5 years ago
- Repository for "Pose Forecasting in Industrial Human-Robot Collaboration" (ECCV 2022)☆36Nov 9, 2022Updated 3 years ago
- dist☆10Dec 14, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2026] Official Repository of 'MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos'☆45Jan 23, 2026Updated 4 months ago
- Code of "Learning Feature Recovery Transformer for Occluded Person Re-identification" (TIP)☆10Dec 28, 2022Updated 3 years ago
- CNNs for Ego-Motion Estimation of Micro Air Vehicles with a Downward-facing Camera☆17Apr 27, 2021Updated 5 years ago
- [ACM MM 2024] Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer☆20Apr 28, 2026Updated last month
- ICLR 2021 i-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning☆80Dec 22, 2023Updated 2 years ago
- Train I3D on NTU-RGB+D dataset in keras☆11Feb 5, 2019Updated 7 years ago
- Theia: Distilling Diverse Vision Foundation Models for Robot Learning☆276Nov 6, 2025Updated 6 months ago