Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
☆21Aug 2, 2024Updated last year
Alternatives and similar repositories for PoseAwareVT
Users that are interested in PoseAwareVT are comparing it to the libraries listed below
Sorting:
- Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"☆17Oct 6, 2025Updated 5 months ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- ☆18Dec 17, 2022Updated 3 years ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Apr 20, 2023Updated 2 years ago
- [WIP] Code for LangToMo☆20Jun 25, 2025Updated 8 months ago
- This repository contains the implementation for our work "TopoDiffusionNet: A Topology-aware Diffusion Model", accepted to ICLR 2025.☆21Apr 17, 2025Updated 10 months ago
- Environments for Active Vision Reinforcement Learning☆28Oct 10, 2024Updated last year
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- [WACV 2024] Code for "Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders"☆25Aug 16, 2024Updated last year
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"☆34Jun 17, 2024Updated last year
- [ICRA'24] Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning☆70Aug 4, 2024Updated last year
- [CAC2023] Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation☆11Nov 28, 2024Updated last year
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆56Jan 31, 2025Updated last year
- Code for NeurIPS 2023 paper "Active Vision Reinforcement Learning with Limited Visual Observability"☆54Oct 10, 2024Updated last year
- Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos☆28Oct 27, 2025Updated 4 months ago
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors☆31Jun 2, 2024Updated last year
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Nov 7, 2023Updated 2 years ago
- Repository for "Pose Forecasting in Industrial Human-Robot Collaboration" (ECCV 2022)☆35Nov 9, 2022Updated 3 years ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Jan 1, 2024Updated 2 years ago
- Collaboration of two technologies (Machine Learning and TCAD) to improve the productivity in Semiconductor manufacturing industry☆10May 3, 2019Updated 6 years ago
- Train I3D on NTU-RGB+D dataset in keras☆12Feb 5, 2019Updated 7 years ago
- This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires python≥3.5☆13Feb 16, 2026Updated 3 weeks ago
- Repository for CHUNGUS (RA-L 2025)☆16May 2, 2025Updated 10 months ago
- YOLOv5 in Pytorch and TensorRT with ROS system implementation☆10Mar 6, 2022Updated 4 years ago
- 🔥[T-ITS 2025, Official Code] for paper "Evidence-based Real-time Road Segmentation with RGB-D Data Augmentation". Official Weights and D…☆15Apr 29, 2025Updated 10 months ago
- ☆15Dec 2, 2025Updated 3 months ago
- Code of "Learning Feature Recovery Transformer for Occluded Person Re-identification" (TIP)☆10Dec 28, 2022Updated 3 years ago
- Official implementation of our CVPR'22 paper.☆13Nov 18, 2022Updated 3 years ago
- Official Repository for CSR - ICML 2025 Oral☆21Feb 28, 2026Updated last week
- Zero-Cost Whole-Body Teleoperation for Mobile Manipulation☆11Mar 4, 2025Updated last year
- Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes☆18Aug 24, 2025Updated 6 months ago
- Modeling Human Motion Using Binary Latent Variables☆10Sep 30, 2017Updated 8 years ago
- dist☆10Dec 14, 2018Updated 7 years ago
- ☆14Dec 6, 2023Updated 2 years ago
- ☆11May 24, 2023Updated 2 years ago
- ☆14Nov 10, 2024Updated last year
- ☆12Apr 14, 2025Updated 10 months ago
- ☆10Aug 25, 2022Updated 3 years ago
- 使用3d激光雷达数据进行障碍物的聚类感知和跟踪,主要参照autoware.ai和跟踪算法(https://github.com/k0suke-murakami/object_tracking)☆12May 27, 2022Updated 3 years ago