[ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting
☆121Mar 20, 2024Updated 2 years ago
Alternatives and similar repositories for CLIP-Count
Users that are interested in CLIP-Count are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Includes FSC-147-D and the code for training and testing the CounTX model from the paper Open-world Text-specified Object Counting.☆41Sep 27, 2024Updated last year
- CounTR: Transformer-based Generalised Visual Counting☆124Jul 11, 2024Updated last year
- [AAAI 2024] VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting☆44Nov 19, 2024Updated last year
- Learning to Count without Annotations☆23May 24, 2024Updated last year
- ☆16Jul 26, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Zero-shot Object Counting with Good Exemplars[ECCV 2024]☆27Sep 27, 2024Updated last year
- CVPR2023 Zero-shot Counting☆60Mar 23, 2025Updated last year
- This is the official implementation of: Learning to Count Anything: Reference-less Class-agnostic Counting with Weak Supervision Michael …☆39Jul 12, 2024Updated last year
- LOCA - A Low-Shot Object Counting Network With Iterative Prototype Adaptation (ICCV 2023)☆60Jul 3, 2024Updated last year
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.☆315Jun 25, 2025Updated 10 months ago
- The official implementation of the crowd counting model CLIP-EBC.☆93Jul 17, 2024Updated last year
- Official PyTorch implementation of FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion☆13Oct 25, 2022Updated 3 years ago
- Spatio-channel Attention Blocks for Cross-modal Crowd Counting -- Official Pytorch Implementation (ACCV'22, Oral)☆27Dec 4, 2023Updated 2 years ago
- [ECCV 2022] An End-to-End Transformer Model for Crowd Localization☆113Mar 20, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Sep 6, 2024Updated last year
- ☆436Nov 30, 2023Updated 2 years ago
- Official implementation of Vision Transformer Off-the-Shelf: A Surprising Baseline for Few-Shot Class-Agnostic Counting☆23May 3, 2024Updated 2 years ago
- [WACV 2023] Few-shot Object Counting with Similarity-Aware Feature Enhancement☆142Oct 10, 2023Updated 2 years ago
- ☆29Jun 10, 2024Updated last year
- PyTorch implementations of the paper: "DR.VIC: Decomposition and Reasoning for Video Individual Counting, CVPR, 2022"☆58Jun 12, 2023Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- GeckoNum Benchmark for T2I Model Eval.☆15Dec 5, 2024Updated last year
- an empirical study on few-shot counting using segment anything (SAM)☆95Apr 25, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Examples of Verbalized Machine Learning (VML)☆16Mar 16, 2025Updated last year
- Code Reproduction☆72Feb 28, 2022Updated 4 years ago
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆56Apr 7, 2025Updated last year
- Few-shot Object Counting and Detection (ECCV 2022)☆83Nov 12, 2024Updated last year
- (CVPR 2024) Point, Segment and Count: A Generalized Framework for Object Counting☆125Nov 12, 2024Updated last year
- PSGCNet: A Pyramidal Scale and Global Context Guided Network for Dense Object Counting in Remote-Sensing Images☆20Jun 13, 2022Updated 3 years ago
- Official Pytorch implementation for "AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild…☆12Jun 26, 2025Updated 10 months ago
- This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point …☆178Apr 22, 2023Updated 3 years ago
- Official repo for CVPR2024 paper "Single Domain Generalization for Crowd Counting"☆97Mar 31, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17Nov 29, 2024Updated last year
- Official Implement of ECCV 2024 paper "Multi-modal Crowd Counting via a Broker Modality"☆17Mar 19, 2026Updated last month
- SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models☆21Jan 11, 2024Updated 2 years ago
- ☆138Apr 2, 2024Updated 2 years ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆31Sep 5, 2023Updated 2 years ago
- ☆15Feb 24, 2023Updated 3 years ago
- [TIP 2023] Redesigning Multi-Scale Neural Network for Crowd Counting☆23Jul 2, 2024Updated last year