Dinghow / UIMLinks
The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting [TOMM 2025]
β24Updated 2 months ago
Alternatives and similar repositories for UIM
Users that are interested in UIM are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examplesβ65Updated last year
- [NeurIPS 2025 π₯] FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysisβ111Updated 4 months ago
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Modelsβ191Updated 2 years ago
- A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-ofβ¦β188Updated 2 weeks ago
- [CVPR 2023 & TPAMI 2025] Explicit Visual Prompting for Low-Level Structure Segmentationsβ220Updated 3 months ago
- [ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"β73Updated 3 months ago
- [CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectoriesβ89Updated 6 months ago
- β28Updated last year
- [CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".β179Updated last year
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasetsβ62Updated 6 months ago
- Code for ''MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation''β35Updated last year
- HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Modelβ86Updated 6 months ago
- [ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinementβ82Updated 9 months ago
- [ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation,β49Updated 10 months ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"β53Updated 4 months ago
- Official repository for Scone (Subject-driven Composition and Distinction Enhancement) model, designed to support multi-subject compositiβ¦β28Updated 3 weeks ago
- [ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"β53Updated 11 months ago
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editingβ140Updated this week
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generationβ163Updated 3 months ago
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Modelsβ124Updated 3 months ago
- β32Updated last year
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Predictionβ201Updated 2 years ago
- β41Updated last year
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruningβ46Updated 6 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inferenceβ97Updated 10 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentationβ57Updated last year
- β77Updated last year
- [AAAI 2025] Official Implementation of "FOCUS: Towards Universal Foreground Segmentation"β55Updated 7 months ago
- [CVPR 2024] LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion.β51Updated last year
- β59Updated last year