cha15yq / T2ICountLinks
Official implement of CVPR2025 paper: "T2ICount: Enhancing Cross-modal Understanding for zero-shot Counting"
☆19Updated 5 months ago
Alternatives and similar repositories for T2ICount
Users that are interested in T2ICount are comparing it to the libraries listed below
Sorting:
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆90Updated 2 years ago
- [AAAI 2024] VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting☆42Updated 10 months ago
- CLIP the Gap CVPR 2023☆81Updated 2 years ago
- [AAAI2025] Code Release of OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision☆28Updated 9 months ago
- ☆29Updated last year
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆73Updated 11 months ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆203Updated last year
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆55Updated last month
- [CVPR 2024] Hybrid Proposal Refiner: Revisiting DETR Series from the Faster R-CNN Perspective☆20Updated last year
- [CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"☆116Updated last year
- 【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification☆110Updated 11 months ago
- [CVPR 2023] CMT: Contrastive Mean Teacher for Domain Adaptive Object Detectors☆45Updated 2 years ago
- [CVPR 2023] Official implementation of the paper "Semi-DETR: Semi-Supervised Object Detection with Detection Transformers"☆98Updated 10 months ago
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆90Updated 4 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆193Updated last year
- ☆21Updated last year
- Source code of the paper Fine-Grained Visual Classification via Internal Ensemble Learning Transformer☆52Updated last year
- [ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting☆116Updated last year
- [CVPR 2024] Exploring Orthogonality in Open World Object Detection☆52Updated 5 months ago
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆81Updated 6 months ago
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆83Updated 5 months ago
- Code release for "Active Teacher for Semi-Supervised Object Detection", CVPR2022☆84Updated 2 years ago
- A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023☆199Updated 2 years ago
- 【IJCAI 2023】RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search☆65Updated 2 years ago
- Official implementation of paper: Masked Retraining Teacher-student Framework for domain adaptive object detection. (ICCV2023)☆41Updated last year
- Pytorch implementation of "Fine-grained Visual Classification with High-temperature Refinement and Background Suppression"☆108Updated last year
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆92Updated 4 months ago
- Official Implementation of "ConfMix: Unsupervised Domain Adaptation for Object Detection via Confidence-based Mixing", WACV 2023☆64Updated 2 years ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆82Updated 3 months ago
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆60Updated 8 months ago