Xu3XiWang / CACViT-AAAI24Links
Official implementation of Vision Transformer Off-the-Shelf: A Surprising Baseline for Few-Shot Class-Agnostic Counting
☆21Updated last year
Alternatives and similar repositories for CACViT-AAAI24
Users that are interested in CACViT-AAAI24 are comparing it to the libraries listed below
Sorting:
- [ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting☆117Updated last year
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆90Updated 5 months ago
- CounTR: Transformer-based Generalised Visual Counting☆118Updated last year
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆90Updated 2 years ago
- LOCA - A Low-Shot Object Counting Network With Iterative Prototype Adaptation (ICCV 2023)☆54Updated last year
- ☆80Updated last year
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆121Updated 7 months ago
- ☆29Updated last year
- [CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"☆159Updated last year
- [CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adapta…☆175Updated last year
- Official implementation of paper FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality Localization (ACM MM 2024…☆74Updated last year
- This repository contains the implementation for the paper "Revisiting Few Shot Object Detection with Vision-Language Models"☆80Updated 5 months ago
- ☆52Updated last year
- Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024☆35Updated last year
- [CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.☆180Updated last year
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆329Updated 9 months ago
- Official implement of CVPR2025 paper: "T2ICount: Enhancing Cross-modal Understanding for zero-shot Counting"☆21Updated 6 months ago
- [CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"☆118Updated last year
- ☆74Updated 8 months ago
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆83Updated this week
- [ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"☆242Updated 11 months ago
- PA-SAM: Prompt Adapter SAM for High-quality Image Segmentation☆97Updated last year
- 【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification☆111Updated last year
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆85Updated 6 months ago
- [CVPR 2024] Official implement of <Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segme…☆375Updated 2 months ago
- Official implement of CVPR2023 ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation☆249Updated 2 years ago
- [CVPR 2023 & TPAMI 2025] Explicit Visual Prompting for Low-Level Structure Segmentations☆215Updated last week
- Official Code for 'Referring Camouflaged Object Detection (指向性伪装物体检测) ' (TPAMI 2025)☆108Updated 9 months ago
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆76Updated last year
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆87Updated last year