Event-AHU / VTF_PAR
[CVPR-2023 Workshop@NFVLR] Official PyTorch implementation of Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition
☆23Updated 8 months ago
Alternatives and similar repositories for VTF_PAR:
Users that are interested in VTF_PAR are comparing it to the libraries listed below
- [CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity☆58Updated 4 months ago
- Official implementation of "A Simple Visual-Textual Baseline for Pedestrian Attribute Recognition" [TCSVT 2022]☆30Updated last year
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆52Updated 7 months ago
- Official pytorch implementation of the ICML2024 main conference paper: Pedestrian Attribute Recognition as Label-balanced Multi-label Lea…☆11Updated 7 months ago
- 【AAAI2024】TOP-ReID: Multi-spectral Object Re-Identification with Token Permutation☆54Updated 3 months ago
- (TPAMI2024) Official implementation of Paper ''A Versatile Framework for Multi-scene Person Re-identification''☆36Updated 10 months ago
- 【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification☆94Updated 3 months ago
- TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)☆44Updated 10 months ago
- ☆10Updated last year
- [NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception☆40Updated 10 months ago
- The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"☆149Updated 6 months ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆49Updated 4 months ago
- OvarNet official implement of the paper "OvarNet: Towards Open-vocabulary Object Attribute Recognition"☆98Updated last year
- Improving Mamaba performance on Video Understanding task☆35Updated 4 months ago
- The official code of "PLIP: Language-Image Pre-training for Person Representation Learning"☆106Updated 2 months ago
- 【AAAI 2024】An Empirical Study of CLIP for Text-based Person Search☆55Updated 10 months ago
- ☆28Updated last year
- [ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting☆101Updated 11 months ago
- LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)☆23Updated 6 months ago
- ☆36Updated last month
- Code for "Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification".☆22Updated 10 months ago
- CounTR: Transformer-based Generalised Visual Counting☆104Updated 7 months ago
- PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-Identification☆20Updated last year
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆51Updated 7 months ago
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆41Updated last year
- [CVPR2024]Day-Night Cross-domain Vehicle Re-identification☆33Updated 3 months ago
- ☆15Updated 9 months ago
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆18Updated 3 weeks ago
- [OpenPAR] An open-source framework for Pedestrian Attribute Recognition, based on PyTorch☆107Updated this week
- This is the official PyTorch implementation of ASAG (ICCV 2023).☆18Updated last year