junkunyuan / HAP
[NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
☆40Updated last year
Alternatives and similar repositories for HAP:
Users that are interested in HAP are comparing it to the libraries listed below
- The official code of "PLIP: Language-Image Pre-training for Person Representation Learning"☆108Updated 3 months ago
- Official PyTorch implementation of UniHCP☆156Updated last year
- (TPAMI 2024) Official implementation of Paper ''A Versatile Framework for Multi-scene Person Re-identification''☆38Updated 11 months ago
- PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-Identification☆20Updated last year
- [CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity☆59Updated 6 months ago
- [ECCV2022] PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification☆62Updated 2 years ago
- [NeurIPS2024] Cross-video Identity Correlating for Person Re-identification Pre-training☆78Updated 3 months ago
- TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)☆46Updated 11 months ago
- ☆112Updated last year
- Large-Scale Pre-training for Person Re-identification with Noisy Labels (LUPerson-NL)☆75Updated 2 years ago
- ☆28Updated last year
- ☆19Updated 11 months ago
- ☆28Updated last year
- Learning Clothing and Pose Invariant 3D Shape Representation for Long-Term Person Re-Identification (ICCV 2023)☆21Updated last year
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆57Updated 8 months ago
- Code for "LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model", CVPR 2024 Highlight☆40Updated 9 months ago
- [ICCV 2023] The official PyTorch code for Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation☆86Updated last year
- ☆50Updated 9 months ago
- [arXiv'21, ICASSP'23] Global-Local Context Network for Person Search.☆29Updated 8 months ago
- Recognize Any Regions☆122Updated 3 months ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated last year
- Official implementation of "A Simple Visual-Textual Baseline for Pedestrian Attribute Recognition" [TCSVT 2022]☆30Updated last year
- [NIPS2023] This is an official implementation of paper "DAC-DETR: Divide the Attention Layers and Conquer".☆54Updated 9 months ago
- The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"☆151Updated 7 months ago
- ☆17Updated 7 months ago
- ☆41Updated 3 months ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆80Updated 11 months ago
- ☆52Updated 2 years ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆59Updated 2 years ago
- MMPD Dataset from ECCV'2024 "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"☆15Updated 8 months ago