Dinghow / UIMLinks
The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting [TOMM 2025]
☆24Updated 2 months ago
Alternatives and similar repositories for UIM
Users that are interested in UIM are comparing it to the libraries listed below
Sorting:
- [ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆73Updated 3 months ago
- A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of…☆188Updated 2 weeks ago
- [NeurIPS 2025 🔥] FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysis☆111Updated 4 months ago
- [CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆179Updated last year
- [CVPR 2023 & TPAMI 2025] Explicit Visual Prompting for Low-Level Structure Segmentations☆220Updated 3 months ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆163Updated 2 months ago
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples☆65Updated last year
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆191Updated 2 years ago
- [ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"☆269Updated last year
- ☆59Updated last year
- ☆28Updated last year
- Towards Training-free Open-world Segmentation via Image Prompt Foundation Models,☆18Updated last year
- [CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories☆89Updated 5 months ago
- HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model☆86Updated 6 months ago
- [ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement☆82Updated 9 months ago
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆62Updated 6 months ago
- [NeurIPS-W 2025] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"☆58Updated 7 months ago
- [ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multi…☆174Updated 10 months ago
- ☆77Updated last year
- Make Large Multimodal Models excel in object detection, ICCV 2025☆62Updated 6 months ago
- SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process☆211Updated 2 years ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆201Updated 2 years ago
- [AAAI 2025] Official Implementation of "FOCUS: Towards Universal Foreground Segmentation"☆55Updated 6 months ago
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆148Updated 3 weeks ago
- Code for ''MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation''☆35Updated last year
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆53Updated 4 months ago
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆70Updated last year
- PyTorch Implementation of "BOOTPLACE: Bootstrapped Object Placement with Detection Transformers", CVPR 2025☆24Updated 5 months ago
- [NeurIPS2025 Spotlight 🔥 ] Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Langu…☆265Updated 3 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆97Updated 10 months ago