Xu3XiWang / CACViT-AAAI24Links
Official implementation of Vision Transformer Off-the-Shelf: A Surprising Baseline for Few-Shot Class-Agnostic Counting
☆20Updated last year
Alternatives and similar repositories for CACViT-AAAI24
Users that are interested in CACViT-AAAI24 are comparing it to the libraries listed below
Sorting:
- [ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting☆110Updated last year
- CounTR: Transformer-based Generalised Visual Counting☆116Updated last year
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆89Updated 3 months ago
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆80Updated 5 months ago
- LOCA - A Low-Shot Object Counting Network With Iterative Prototype Adaptation (ICCV 2023)☆53Updated last year
- ☆52Updated last year
- [CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"☆151Updated 10 months ago
- ☆27Updated last year
- [CVPR 2024] Exploring Orthogonality in Open World Object Detection☆48Updated 3 months ago
- Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024☆35Updated 11 months ago
- [CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model☆88Updated 2 years ago
- [AAAI 2024] VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting☆42Updated 8 months ago
- [CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.☆176Updated last year
- ☆77Updated last year
- Official Pytorch code for Open World Object Detection in the Era of Foundation Models☆79Updated last year
- [CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆113Updated 4 months ago
- This repository contains the implementation for the paper "Revisiting Few Shot Object Detection with Vision-Language Models"☆73Updated 2 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆191Updated last year
- CVPR2023 Zero-shot Counting☆57Updated 4 months ago
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆75Updated 11 months ago
- Official implement of CVPR2023 ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation☆242Updated last year
- Multi-Class Few-Shot Semantic Segmentation with Visual Prompts☆57Updated last week
- [CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆160Updated 7 months ago
- Includes FSC-147-D and the code for training and testing the CounTX model from the paper Open-world Text-specified Object Counting.☆38Updated 10 months ago
- Code Implementation of "Unsupervised Recognition of Unknown Objects for Open-World Object Detection"☆28Updated last year
- ☆68Updated 5 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆72Updated 10 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆51Updated 11 months ago
- [CVPR 2024] Dual Prototype Attention for Unsupervised Video Object Segmentation☆33Updated last year
- Official implementation for CVPR 2022 paper "Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting".☆72Updated last year