343gltysprk / ovow
☆19Updated last month
Alternatives and similar repositories for ovow:
Users that are interested in ovow are comparing it to the libraries listed below
- ☆28Updated 2 weeks ago
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆68Updated 4 months ago
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆36Updated last year
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆37Updated this week
- state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆29Updated 8 months ago
- ☆12Updated 2 months ago
- Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".☆50Updated 8 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆30Updated 6 months ago
- (CVPR 2024) ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning☆44Updated 3 weeks ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆30Updated last month
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆40Updated 2 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆73Updated 4 months ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆26Updated 11 months ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆73Updated 9 months ago
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆61Updated 2 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆66Updated 3 months ago
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆23Updated last week
- A Large Multimodal Model for Pixel-Level Visual Grounding in Videos☆37Updated 3 weeks ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆46Updated 4 months ago
- ☆29Updated 9 months ago
- Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆38Updated last week
- ☆61Updated 2 months ago
- ☆62Updated last year
- (ECCV 2024) Can OOD Object Detectors Learn from Foundation Models?☆23Updated last month
- [ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition☆44Updated 3 months ago
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆51Updated 9 months ago
- Text4Seg: Reimagining Image Segmentation as Text Generation☆33Updated last week
- [ECCV 2024] Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation☆34Updated this week
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆39Updated 3 weeks ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year