[ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting
☆121Mar 20, 2024Updated 2 years ago
Alternatives and similar repositories for CLIP-Count
Users that are interested in CLIP-Count are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Includes FSC-147-D and the code for training and testing the CounTX model from the paper Open-world Text-specified Object Counting.☆41Sep 27, 2024Updated last year
- CounTR: Transformer-based Generalised Visual Counting☆124Jul 11, 2024Updated last year
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆92Jul 28, 2023Updated 2 years ago
- [AAAI 2024] VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting☆44Nov 19, 2024Updated last year
- Learning to Count without Annotations☆23May 24, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆17Jul 26, 2023Updated 2 years ago
- Zero-shot Object Counting with Good Exemplars[ECCV 2024]☆27Sep 27, 2024Updated last year
- This is the official implementation of: Learning to Count Anything: Reference-less Class-agnostic Counting with Weak Supervision Michael …☆38Jul 12, 2024Updated last year
- LOCA - A Low-Shot Object Counting Network With Iterative Prototype Adaptation (ICCV 2023)☆63Jul 3, 2024Updated last year
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.☆321Jun 25, 2025Updated 11 months ago
- Official PyTorch implementation of FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion☆13Oct 25, 2022Updated 3 years ago
- The codes for ACM Multimedia 2023 paper 'DAOT: Domain-Agnostically Aligned Optimal Transport for Domain-Adaptive Crowd Counting. '☆13Jan 12, 2024Updated 2 years ago
- Spatio-channel Attention Blocks for Cross-modal Crowd Counting -- Official Pytorch Implementation (ACCV'22, Oral)☆27Dec 4, 2023Updated 2 years ago
- [ECCV 2022] An End-to-End Transformer Model for Crowd Localization☆114Mar 20, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆16Sep 6, 2024Updated last year
- ☆441Nov 30, 2023Updated 2 years ago
- Official implementation of Vision Transformer Off-the-Shelf: A Surprising Baseline for Few-Shot Class-Agnostic Counting☆24May 3, 2024Updated 2 years ago
- [WACV 2023] Few-shot Object Counting with Similarity-Aware Feature Enhancement☆142Oct 10, 2023Updated 2 years ago
- ☆28Feb 21, 2025Updated last year
- ☆29Jun 10, 2024Updated 2 years ago
- This is a repository about NWPU-MOC dataset and code.☆23Jan 24, 2024Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 3 years ago
- GeckoNum Benchmark for T2I Model Eval.☆15Dec 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Disentangled Graph Variational Auto-Encoder for Multimodal Recommendation with Interpretability, IEEE TMM☆16Jun 3, 2025Updated last year
- an empirical study on few-shot counting using segment anything (SAM)☆96Apr 25, 2023Updated 3 years ago
- STEERER: Resolving Scale Variations for Counting and Localization via Selective Inheritance Learning, ICCV, 2023☆61Mar 1, 2024Updated 2 years ago
- Examples of Verbalized Machine Learning (VML)☆16Mar 16, 2025Updated last year
- Code Reproduction☆72Feb 28, 2022Updated 4 years ago
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆56Apr 7, 2025Updated last year
- (CVPR 2024) Point, Segment and Count: A Generalized Framework for Object Counting☆125Nov 12, 2024Updated last year
- PSGCNet: A Pyramidal Scale and Global Context Guided Network for Dense Object Counting in Remote-Sensing Images☆20Jun 13, 2022Updated 4 years ago
- This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point …☆182Apr 22, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆17Nov 29, 2024Updated last year
- Official Implement of ECCV 2024 paper "Multi-modal Crowd Counting via a Broker Modality"☆18Mar 19, 2026Updated 2 months ago
- SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models☆21Jan 11, 2024Updated 2 years ago
- ☆142Apr 2, 2024Updated 2 years ago
- ☆15Feb 24, 2023Updated 3 years ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆31Sep 5, 2023Updated 2 years ago
- 使用ONNXRuntime部署Detic检测2万1千种类别的物体,包含C++和Python两个版本的程序☆17Aug 29, 2023Updated 2 years ago