UX-Decoder / DINOv
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
☆393Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for DINOv
- [ICLR'24] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching☆448Updated 3 months ago
- [ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"☆653Updated 9 months ago
- [NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…☆287Updated 9 months ago
- [NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"☆271Updated 7 months ago
- CoRL 2024☆345Updated 3 weeks ago
- CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks☆361Updated last year
- Open-vocabulary Semantic Segmentation☆315Updated last month
- Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion☆250Updated 2 months ago
- [ECCV 2024] Tokenize Anything via Prompting☆534Updated 4 months ago
- [NeurIPS 2024] Code release for "Segment Anything without Supervision"☆420Updated last month
- A collection of project, papers, and source code for Meta AI's Segment Anything Model (SAM) and related studies.☆328Updated this week
- [ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation☆357Updated last year
- [ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"☆193Updated this week
- Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)☆405Updated 2 years ago
- [ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"☆219Updated 6 months ago
- using clip and sam to segment any instance you specify with text prompt of any instance names☆172Updated last year
- [CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception☆489Updated 6 months ago
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.☆90Updated last month
- ☆211Updated 4 months ago
- [CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"☆97Updated last month
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆808Updated 2 weeks ago
- A curated list of papers, datasets and resources pertaining to open vocabulary object detection.☆284Updated 4 months ago
- [ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"☆308Updated last month
- Connecting segment-anything's output masks with the CLIP model; Awesome-Segment-Anything-Works☆178Updated last month
- Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such…☆196Updated last year
- This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detectio…☆430Updated 4 months ago
- Experiment on combining CLIP with SAM to do open-vocabulary image segmentation.☆342Updated last year
- [NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models☆306Updated last year
- Recognize Any Regions☆118Updated last month