vpulab/ovam

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vpulab/ovam)

vpulab / ovam

Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024

☆70

Alternatives and similar repositories for ovam

Users that are interested in ovam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

letitiabanana / PnP-OVSS
View on GitHub
[CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
☆18Jul 22, 2024Updated 2 years ago
aimagelab / freeda
View on GitHub
FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)
☆50Aug 28, 2024Updated last year
Yqcca / CMLoRA
View on GitHub
Cached Multi-Lora Composition for Multi-Concept Image Generation
☆17Jun 13, 2025Updated last year
jdg900 / MMR
View on GitHub
[ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segm…
☆28Apr 3, 2025Updated last year
Valkyrja3607 / MaskDiffusion
View on GitHub
Code for ''MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation''
☆36Mar 23, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Kunhao-Liu / 3D-OVS
View on GitHub
[NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation
☆128May 5, 2026Updated 2 months ago
slonetime / EBSeg
View on GitHub
[CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
☆41Jan 12, 2026Updated 6 months ago
xiaotianqing / InstDiffEdit
View on GitHub
The code of Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
☆27Apr 10, 2024Updated 2 years ago
hectorcarrion / FEDD
View on GitHub
Data & Code for FEDD published @ MICCAI 23
☆12Oct 11, 2023Updated 2 years ago
Monalissaa / DisenDiff
View on GitHub
[CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization
☆111Apr 10, 2024Updated 2 years ago
sakharok13 / Aligning-Stable-Diffusion-with-Noise-Conditioned-Perception
View on GitHub
☆17Aug 13, 2024Updated last year
lucabarsellotti / awesome-open-vocabulary-semantic-segmentation
View on GitHub
☆15May 7, 2024Updated 2 years ago
yunzhuC / MCSD
View on GitHub
Multi-consistency for Semi-Supervised medical Image Segmentation with Diffusion Model
☆10Feb 23, 2025Updated last year
buxiangzhiren / VD-IT
View on GitHub
Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024
☆48Sep 28, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ChangyaoTian / ADDP
View on GitHub
The official implementation of ADDP (ICLR 2024)
☆12Mar 27, 2024Updated 2 years ago
helblazer811 / ConceptAttention
View on GitHub
ConceptAttention: A method for interpreting multi-modal diffusion transformers.
☆461Jan 16, 2026Updated 6 months ago
riiid / PPAP
View on GitHub
Official pytorch implementation of "Towards Practical Plug-and-Play Diffusion Models" in CVPR2023
☆22Jul 22, 2023Updated 3 years ago
fudan-zvg / GSS
View on GitHub
[CVPR 2023] Official repository of Generative Semantic Segmentation
☆222Sep 3, 2023Updated 2 years ago
JasonQSY / AffordanceLLM
View on GitHub
Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"
☆14Oct 18, 2024Updated last year
castorini / daam
View on GitHub
Diffusion attentive attribution maps for interpreting Stable Diffusion.
☆801Apr 5, 2024Updated 2 years ago
VDIGPKU / NAS-BNN
View on GitHub
The official implementation of "NAS-BNN: Neural Architecture Search for Binary Neural Networks"
☆14Aug 30, 2024Updated last year
jianzongwu / Awesome-Open-Vocabulary
View on GitHub
(TPAMI 2024) A Survey on Open Vocabulary Learning
☆998May 12, 2026Updated 2 months ago
xb534 / SED
View on GitHub
[TPAMI2025&CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.
☆199May 30, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Lipurple / Grounded-Diffusion
View on GitHub
Open-vocabulary Object Segmentation with Diffusion Models
☆184Aug 15, 2023Updated 2 years ago
google / diffseg
View on GitHub
DiffSeg is an unsupervised zero-shot segmentation method using attention information from a stable-diffusion model. This repo implements …
☆330Jul 9, 2024Updated 2 years ago
MIPAL-SNU / Tutorial
View on GitHub
For prospective and new joiners
☆10Oct 25, 2024Updated last year
tmtuan1307 / NAYER
View on GitHub
[CVPR-2024] NAYER: Noisy Layer Data Generation for Efficient and Effective Data-free Knowledge Distillation
☆16Oct 19, 2024Updated last year
Qinying-Liu / Awesome-Open-Vocabulary-Semantic-Segmentation
View on GitHub
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
☆892May 20, 2026Updated 2 months ago
OpenGVLab / DiffAgent
View on GitHub
[CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
☆19Apr 16, 2024Updated 2 years ago
VCG-team / DiffSegmenter
View on GitHub
(TIP 2025, CCF-A) Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter
☆55Jul 5, 2026Updated 2 weeks ago
lhaof / Nudiff
View on GitHub
Diffusion-based Data Augmentation for Nuclei Image Segmentation (MICCAI 2023)
☆23Oct 30, 2023Updated 2 years ago
BIT-DYN / OpenObj
View on GitHub
[RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding
☆32Feb 17, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
L-YeZhu / BoundaryDiffusion
View on GitHub
[NeurIPS2023] BoundaryDiffusion: A learning-free method for semantic control with Diffusion Models
☆40Nov 1, 2023Updated 2 years ago
Fsoft-AIC / Z-GMOT
View on GitHub
[NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking
☆12May 19, 2026Updated 2 months ago
jacobmarks / pytesseract-ocr-plugin
View on GitHub
Run optical character recognition with PyTesseract from the FiftyOne App!
☆11Apr 5, 2024Updated 2 years ago
Video-MAC / VideoMAC
View on GitHub
Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”
☆16May 12, 2026Updated 2 months ago
jiaosiyu1999 / MAFT-Plus
View on GitHub
☆60Sep 14, 2024Updated last year
NVlabs / ODISE
View on GitHub
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
☆945Jul 6, 2024Updated 2 years ago
sinahmr / NACLIP
View on GitHub
PyTorch Implementation of NACLIP in "Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation"
☆79Sep 23, 2024Updated last year