Event-AHU / VTF_PAR
[CVPR-2023 Workshop@NFVLR] Official PyTorch implementation of Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition
☆23Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for VTF_PAR
- 【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification☆86Updated last month
- TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)☆41Updated 7 months ago
- Official pytorch implementation of the ICML2024 main conference paper: Pedestrian Attribute Recognition as Label-balanced Multi-label Lea…☆9Updated 4 months ago
- The official code of "PLIP: Language-Image Pre-training for Person Representation Learning"☆102Updated last year
- Official implementation of "A Simple Visual-Textual Baseline for Pedestrian Attribute Recognition" [TCSVT 2022]☆29Updated 10 months ago
- (TPAMI2024) Official implementation of Paper ''A Versatile Framework for Multi-scene Person Re-identification''☆34Updated 7 months ago
- 【AAAI2024】TOP-ReID: Multi-spectral Object Re-Identification with Token Permutation☆51Updated last month
- [CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity☆54Updated last month
- [NeurIPS2024] Cross-video Identity Correlating for Person Re-identification Pre-training☆66Updated last month
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆73Updated 7 months ago
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆39Updated 4 months ago
- [NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception☆39Updated 8 months ago
- ☆28Updated last year
- [ECCV2024] Official implementation of the paper "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dat…☆46Updated 4 months ago
- ☆13Updated 6 months ago
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆58Updated this week
- The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"☆139Updated 3 months ago
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆74Updated last year
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆40Updated last month
- View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network (CVPR'24)☆35Updated 7 months ago
- [CVPR2024]Day-Night Cross-domain Vehicle Re-identification☆22Updated 3 weeks ago
- ☆17Updated last year
- [OpenPAR] An open-source framework for Pedestrian Attribute Recognition, based on PyTorch☆83Updated 2 weeks ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆48Updated 4 months ago
- ☆24Updated 8 months ago
- [ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting☆91Updated 8 months ago
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆66Updated last month
- PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-Identification☆13Updated last year
- Improving Mamaba performance on Video Understanding task☆32Updated last month
- Code for "Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification".☆18Updated 7 months ago