cha15yq / T2ICountLinks
Official implement of CVPR2025 paper: "T2ICount: Enhancing Cross-modal Understanding for zero-shot Counting"
☆21Updated 8 months ago
Alternatives and similar repositories for T2ICount
Users that are interested in T2ICount are comparing it to the libraries listed below
Sorting:
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆91Updated 2 years ago
- [AAAI 2024] VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting☆43Updated last year
- ☆30Updated last year
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆73Updated last year
- CLIP the Gap CVPR 2023☆82Updated 2 years ago
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆94Updated 7 months ago
- [ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting☆121Updated last year
- CLIMB-ReID: A Hybrid CLIP-Mamba Framework for Person Re-Identification(AAAI2025)☆40Updated last month
- Official implementation of paper: Masked Retraining Teacher-student Framework for domain adaptive object detection. (ICCV2023)☆44Updated 2 years ago
- 【AAAI2024】TOP-ReID: Multi-spectral Object Re-Identification with Token Permutation☆69Updated last year
- ☆25Updated 10 months ago
- ☆134Updated last year
- Code release for "Active Teacher for Semi-Supervised Object Detection", CVPR2022☆84Updated 2 years ago
- [CVPR 2024] Exploring Orthogonality in Open World Object Detection☆55Updated 7 months ago
- Source code of the paper Fine-Grained Visual Classification via Internal Ensemble Learning Transformer☆54Updated last year
- 【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification☆113Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆210Updated last year
- Code release for Dilated-Scale-Aware Category-Attention ConvNet for Multi-Class Object Counting☆23Updated 2 years ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆200Updated last year
- Official Implementation of "ConfMix: Unsupervised Domain Adaptation for Object Detection via Confidence-based Mixing", WACV 2023☆67Updated 2 years ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆57Updated last month
- [ICCV 2023] Point-Query Quadtree for Crowd Counting, Localization, and More☆80Updated 8 months ago
- 【CVPR2025】IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification☆43Updated 8 months ago
- NTIRE 2025 Challenge on 1-st Cross-Domain Few-Shot Object Detection @ CVPR 2025☆65Updated 8 months ago
- SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks (CVPR'25)☆19Updated 5 months ago
- CVPR-2023 paper "Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting"☆26Updated 2 years ago
- [CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"☆119Updated last year
- ☆16Updated 2 years ago
- ☆11Updated last year
- ☆52Updated last year