cha15yq / T2ICountLinks
Official implement of CVPR2025 paper: "T2ICount: Enhancing Cross-modal Understanding for zero-shot Counting"
☆21Updated 6 months ago
Alternatives and similar repositories for T2ICount
Users that are interested in T2ICount are comparing it to the libraries listed below
Sorting:
- [AAAI 2024] VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting☆42Updated 11 months ago
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆90Updated 2 years ago
- 【AAAI2024】TOP-ReID: Multi-spectral Object Re-Identification with Token Permutation☆65Updated last year
- [CVPR 2024] Hybrid Proposal Refiner: Revisiting DETR Series from the Faster R-CNN Perspective☆20Updated last year
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆73Updated last year
- CLIP the Gap CVPR 2023☆82Updated 2 years ago
- [CVPR 2023] CMT: Contrastive Mean Teacher for Domain Adaptive Object Detectors☆44Updated 2 years ago
- 【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification☆111Updated last year
- Code release for "Active Teacher for Semi-Supervised Object Detection", CVPR2022☆84Updated 2 years ago
- [CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"☆118Updated last year
- ☆29Updated last year
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆77Updated last year
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆54Updated 2 months ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆206Updated last year
- [CVPR 2024] Exploring Orthogonality in Open World Object Detection☆54Updated 5 months ago
- 【CVPR2025】IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification☆39Updated 6 months ago
- ☆87Updated 2 years ago
- [CVPR 2023] Official implementation of the paper "Semi-DETR: Semi-Supervised Object Detection with Detection Transformers"☆98Updated 11 months ago
- [NeurIPS2024] PLIP: Language-Image Pre-training for Person Representation Learning☆124Updated 10 months ago
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆90Updated 5 months ago
- Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection CVPR 2025☆19Updated 8 months ago
- Source code of the paper Fine-Grained Visual Classification via Internal Ensemble Learning Transformer☆53Updated last year
- [CVPR 2023] Adaptive Sparse Pairwise Loss for Object Re-Identification☆60Updated 2 years ago
- View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network (CVPR'24)☆45Updated last year
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆195Updated last year
- (TPAMI 2024) Official implementation of Paper ''A Versatile Framework for Multi-scene Person Re-identification''☆48Updated last year
- ☆21Updated last year
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆53Updated last year
- TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)☆59Updated last year
- ☆26Updated 2 years ago