Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
☆21Aug 2, 2024Updated last year
Alternatives and similar repositories for PoseAwareVT
Users that are interested in PoseAwareVT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆30Nov 12, 2025Updated 4 months ago
- ☆18Dec 17, 2022Updated 3 years ago
- [WACV 2024] Code for "Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders"☆25Aug 16, 2024Updated last year
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆25Jan 9, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- Environments for Active Vision Reinforcement Learning☆29Oct 10, 2024Updated last year
- ☆20Jan 29, 2023Updated 3 years ago
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"☆36Jun 17, 2024Updated last year
- [ICRA'24] Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning☆70Aug 4, 2024Updated last year
- Pose driven attention mechanism☆44Mar 31, 2022Updated 3 years ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Nov 7, 2023Updated 2 years ago
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆57Jan 31, 2025Updated last year
- Tools for Toyota Smarthome datasets☆14Nov 16, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code repository for Computer Vision Projects with Python 3, Published By Packt☆14Jan 15, 2021Updated 5 years ago
- Pytorch I3D implmentation on Toyota Smarthome Dataset☆17Apr 23, 2022Updated 3 years ago
- This is a repo of extension of VPN for Recognition of Activities of Daily Living☆16May 17, 2021Updated 4 years ago
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors☆31Jun 2, 2024Updated last year
- Test-Time Personalization with a Transformer for Human Pose Estimation, NeurIPS 2021☆45Oct 23, 2021Updated 4 years ago
- Package (ROS 1 & ROS 2) for human keypoints identification, 3D reconstruction, tracking, and filtering in collaborative robotics.☆16Nov 20, 2025Updated 4 months ago
- [CAC2023] Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation☆11Nov 28, 2024Updated last year
- ☆14Jun 25, 2022Updated 3 years ago
- Code for our WACV 2021 paper "Exploiting the Redundancy in Convolutional Filters for Parameter Reduction"☆11Jan 6, 2021Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆23Dec 22, 2024Updated last year
- [CVPR 2026] Official Repository of 'MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos'☆39Jan 23, 2026Updated 2 months ago
- [ACM MM 2024] Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer☆20Aug 15, 2025Updated 7 months ago
- ICLR 2021 i-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning☆80Dec 22, 2023Updated 2 years ago
- Pseudo Labelling on MNIST dataset in Tensorflow 2.x☆10Jul 12, 2022Updated 3 years ago
- Train I3D on NTU-RGB+D dataset in keras☆11Feb 5, 2019Updated 7 years ago
- Theia: Distilling Diverse Vision Foundation Models for Robot Learning☆272Nov 6, 2025Updated 4 months ago
- [ECML-PKDD 2025] Official Implementation of "Trajectory Imputation in Multi-Agent Sports with Derivative-Accumulating Self-Ensemble".☆14Jun 20, 2025Updated 9 months ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Jan 1, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Jan 22, 2025Updated last year
- ☆12Dec 4, 2020Updated 5 years ago
- Human Co-Parsing Guided Alignment for Occluded Person Re-identification(IEEE T-IP 23)☆14Aug 30, 2024Updated last year
- ☆20Jun 3, 2020Updated 5 years ago
- Object detection using Single-Shot-Detection architecture using MobileNet as the basenet☆15Jul 5, 2019Updated 6 years ago
- Mutual Distillation Learning For Person Re-identification☆19Jan 30, 2024Updated 2 years ago
- PyTorch training at CSCS☆20Jul 4, 2025Updated 8 months ago