Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024
☆71Jun 14, 2024Updated last year
Alternatives and similar repositories for ovam
Users that are interested in ovam are comparing it to the libraries listed below
Sorting:
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆18Jul 22, 2024Updated last year
- FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)☆49Aug 28, 2024Updated last year
- [ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segm…☆24Apr 3, 2025Updated 11 months ago
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆40Jan 12, 2026Updated last month
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆125Jan 11, 2024Updated 2 years ago
- Official pytorch implementation of "Towards Practical Plug-and-Play Diffusion Models" in CVPR2023☆22Jul 22, 2023Updated 2 years ago
- Training recipe for SpatialReasoner☆38Sep 21, 2025Updated 5 months ago
- This is the project for 'USG'.☆36Apr 7, 2025Updated 10 months ago
- ☆23Feb 12, 2026Updated 3 weeks ago
- Multi-consistency for Semi-Supervised medical Image Segmentation with Diffusion Model☆10Feb 23, 2025Updated last year
- ☆12Jun 27, 2022Updated 3 years ago
- Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation (CVPR 2024)☆46Oct 17, 2024Updated last year
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆47Sep 28, 2024Updated last year
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year
- official code for "3D Question Answering via only 2D Vision-Language Models"☆23Jan 15, 2026Updated last month
- ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding☆17Aug 8, 2025Updated 6 months ago
- [NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking☆13May 3, 2024Updated last year
- Official PyTorch implementation of “MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation”☆18Dec 5, 2024Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆109Apr 10, 2024Updated last year
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆53Sep 26, 2025Updated 5 months ago
- The official implementation of ADDP (ICLR 2024)☆12Mar 27, 2024Updated last year
- Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”☆12Mar 4, 2024Updated 2 years ago
- ☆15May 7, 2024Updated last year
- 🔥GrabS in PyTorch (ICLR 2025 Spotlight)☆19Aug 26, 2025Updated 6 months ago
- [CVPR-2024] NAYER: Noisy Layer Data Generation for Efficient and Effective Data-free Knowledge Distillation☆16Oct 19, 2024Updated last year
- The official implementation of "NAS-BNN: Neural Architecture Search for Binary Neural Networks"☆13Aug 30, 2024Updated last year
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆15Jan 26, 2023Updated 3 years ago
- Data & Code for FEDD published @ MICCAI 23☆12Oct 11, 2023Updated 2 years ago
- Cached Multi-Lora Composition for Multi-Concept Image Generation☆16Jun 13, 2025Updated 8 months ago
- ☆33Sep 27, 2024Updated last year
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…☆12Apr 4, 2025Updated 11 months ago
- ☆28Jan 15, 2026Updated last month
- ☆17Aug 13, 2024Updated last year
- ☆59Sep 14, 2024Updated last year
- [CVPR 2024 Highlight] OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies☆72Aug 22, 2025Updated 6 months ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆62Aug 3, 2024Updated last year
- Open-vocabulary Object Segmentation with Diffusion Models☆183Aug 15, 2023Updated 2 years ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆36Apr 21, 2024Updated last year