mondalanindya / OmniCountLinks
[AAAI 2025] Official code for "OmniCount: Multi-label Object Counting with Semantic-Geometric Priors"
☆19Updated last month
Alternatives and similar repositories for OmniCount
Users that are interested in OmniCount are comparing it to the libraries listed below
Sorting:
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆89Updated 2 years ago
- Official implement of CVPR2025 paper: "T2ICount: Enhancing Cross-modal Understanding for zero-shot Counting"☆18Updated 4 months ago
- TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)☆54Updated last year
- 【AAAI2024】TOP-ReID: Multi-spectral Object Re-Identification with Token Permutation☆64Updated 10 months ago
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆75Updated last year
- PyTorch implementations of the paper: "DR.VIC: Decomposition and Reasoning for Video Individual Counting, CVPR, 2022"☆58Updated 2 years ago
- OvarNet official implement of the paper "OvarNet: Towards Open-vocabulary Object Attribute Recognition"☆105Updated 2 years ago
- [CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity☆66Updated 11 months ago
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆73Updated 10 months ago
- The official code of "PLIP: Language-Image Pre-training for Person Representation Learning"☆120Updated 8 months ago
- The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"☆162Updated last month
- ☆12Updated last year
- ☆14Updated last year
- [NeurIPS2024] Cross-video Identity Correlating for Person Re-identification Pre-training☆91Updated 2 months ago
- (TPAMI 2024) Official implementation of Paper ''A Versatile Framework for Multi-scene Person Re-identification''☆44Updated last year
- A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023☆199Updated 2 years ago
- [ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting☆113Updated last year
- View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network (CVPR'24)☆43Updated last year
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆55Updated 3 weeks ago
- Code for "Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification".☆31Updated last year
- [AAAI 2024] VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting☆42Updated 9 months ago
- ☆125Updated last year
- Human-centered Interactive Learning via MLLMs for Text-to-Image Person Re-identification (CVPR 2025 Pytorch Code)☆26Updated last month
- Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale person image databas…☆26Updated 3 months ago
- Multi-Granularity Language-Guided Multi-Object Tracking☆23Updated 3 weeks ago
- ☆17Updated last year
- ☆41Updated last year
- ☆20Updated last year
- ☆23Updated 6 months ago
- 【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification☆109Updated 10 months ago