niki-amini-naieni / CounTXLinks
Includes FSC-147-D and the code for training and testing the CounTX model from the paper Open-world Text-specified Object Counting.
☆41Updated last year
Alternatives and similar repositories for CounTX
Users that are interested in CounTX are comparing it to the libraries listed below
Sorting:
- [ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting☆122Updated last year
- CounTR: Transformer-based Generalised Visual Counting☆120Updated last year
- LOCA - A Low-Shot Object Counting Network With Iterative Prototype Adaptation (ICCV 2023)☆54Updated last year
- CVPR2023 Zero-shot Counting☆60Updated 9 months ago
- Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"☆95Updated 2 years ago
- Recognize Any Regions☆122Updated last year
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆95Updated 7 months ago
- ☆53Updated 2 years ago
- ☆52Updated last year
- [CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…☆64Updated 9 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆201Updated last year
- ☆25Updated 10 months ago
- OvarNet official implement of the paper "OvarNet: Towards Open-vocabulary Object Attribute Recognition"☆105Updated 2 years ago
- [ECCV 2024] The official PyTorch implementation of the "Plain-Det: A Plain Multi-Dataset Object Detector".☆29Updated last year
- Official implementation for CVPR 2022 paper "Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting".☆73Updated 2 years ago
- ☆30Updated last year
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆94Updated 11 months ago
- [WACV 2023] Few-shot Object Counting with Similarity-Aware Feature Enhancement☆142Updated 2 years ago
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆91Updated 2 years ago
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆184Updated 2 years ago
- Official implementation of Vision Transformer Off-the-Shelf: A Surprising Baseline for Few-Shot Class-Agnostic Counting☆21Updated last year
- [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection☆188Updated 9 months ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆87Updated last year
- Official repository of paper "WeDetect: Fast Open-Vocabulary Object Detection as Retrieval"☆58Updated 2 weeks ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆64Updated last year
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆123Updated last year
- [CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.☆91Updated 2 years ago
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆88Updated last week
- Few-shot Object Counting and Detection (ECCV 2022)☆81Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆211Updated last year