Dinghow / UIM
The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting [Arxiv]
☆23Updated last year
Alternatives and similar repositories for UIM
Users that are interested in UIM are comparing it to the libraries listed below
Sorting:
- FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysis☆47Updated last month
- The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆36Updated last week
- The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆47Updated 2 months ago
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples☆53Updated 6 months ago
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆29Updated last month
- [ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multi…☆149Updated last month
- When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆22Updated 3 weeks ago
- Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆139Updated 5 months ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆91Updated last month
- Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆46Updated last month
- [ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement☆46Updated 3 weeks ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆82Updated last month
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆176Updated last year
- ☆30Updated 7 months ago
- [CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.☆56Updated 2 months ago
- Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"☆37Updated 3 months ago
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆36Updated last month
- ☆26Updated 9 months ago
- A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of…☆63Updated 3 weeks ago
- [CVPR 2023] The models, datasets(satellite&street view) and correlative config files of OmniCity-v1.0 project.☆28Updated last month
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆86Updated last year
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆62Updated last year
- ☆28Updated 5 months ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆88Updated 2 months ago
- [CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories☆38Updated 2 months ago
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆46Updated last month
- ☆74Updated 6 months ago
- The official repository of Real Text Manipulation (RTM)☆35Updated last month
- [CVPR 2023] Explicit Visual Prompting for Low-Level Structure Segmentations☆204Updated last year
- Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface"☆184Updated last month