vpulab / ovam
Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024
☆57Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for ovam
- FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)☆29Updated 2 months ago
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆46Updated 7 months ago
- Official PyTorch Implementation for Diffusion Hyperfeatures, NeurIPS 2023☆95Updated last month
- ☆32Updated last month
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆61Updated last month
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆78Updated 8 months ago
- Dataset Diffusion: Diffusion-based Synthetic Data Generation for Pixel-Level Semantic Segmentation (NeurIPS2023)☆105Updated 2 months ago
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆86Updated 3 weeks ago
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆34Updated last year
- Large-Vocabulary Video Instance Segmentation dataset☆76Updated 4 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆33Updated 2 weeks ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆30Updated this week
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆35Updated last month
- 1-shot image segmentation using Stable Diffusion☆127Updated 8 months ago
- [ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning…☆22Updated 3 months ago
- Text-Image Alignment for Diffusion-based Perception (TADP) - CVPR 2024☆24Updated 2 months ago
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆214Updated 3 weeks ago
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆56Updated 9 months ago
- Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆17Updated 8 months ago
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆47Updated 3 months ago
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆153Updated last year
- Official Implementation for CVPR 2024 paper: CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor☆100Updated 4 months ago
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆57Updated 7 months ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆72Updated 4 months ago
- Official implementation of "Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive" (ICLR 2024)☆52Updated 2 months ago
- [CVPR 2024] Official Implementation of Collaborating Foundation models for Domain Generalized Semantic Segmentation☆61Updated 4 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆65Updated 3 months ago
- ☆26Updated last year
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆34Updated 2 months ago
- Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs".☆42Updated 2 months ago