jianzongwu / Awesome-Open-Vocabulary
(TPAMI 2024) A Survey on Open Vocabulary Learning
☆845Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Open-Vocabulary
- A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..☆461Updated this week
- Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts☆1,057Updated 3 months ago
- A curated list of papers, datasets and resources pertaining to open vocabulary object detection.☆285Updated 4 months ago
- [T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey☆699Updated 2 months ago
- [CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception☆489Updated 6 months ago
- [ECCV 2024] The official code of paper "Open-Vocabulary SAM".☆950Updated 3 months ago
- Open-vocabulary Semantic Segmentation☆315Updated last month
- [CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want☆706Updated 3 months ago
- This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detectio…☆434Updated 4 months ago
- [ICLR'24] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching☆449Updated 3 months ago
- [ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"☆653Updated 10 months ago
- ☆463Updated 2 weeks ago
- ☆557Updated 11 months ago
- Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)☆406Updated 2 years ago
- Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".☆538Updated last year
- Collection of AWESOME vision-language models for vision tasks☆2,513Updated 2 weeks ago
- This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.☆693Updated last year
- [CVPR 2024] Official implementation of the paper "Visual In-context Learning"☆393Updated 7 months ago
- ❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119☆1,044Updated last year
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆204Updated this week
- CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks☆364Updated last year
- Project Page for "LISA: Reasoning Segmentation via Large Language Model"☆1,875Updated 4 months ago
- [CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses tha…☆783Updated 5 months ago
- This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).☆843Updated this week
- Recent LLM-based CV and related works. Welcome to comment/contribute!☆842Updated 5 months ago
- [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale☆1,085Updated last month
- [ECCV 2024] Tokenize Anything via Prompting☆534Updated 4 months ago
- [CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"☆713Updated 8 months ago
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆300Updated last month
- [CVPR'23] Universal Instance Perception as Object Discovery and Retrieval☆1,503Updated last year