[ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segmentation
☆24Apr 3, 2025Updated 10 months ago
Alternatives and similar repositories for MMR
Users that are interested in MMR are comparing it to the libraries listed below
Sorting:
- Paper list for LLM/MLLM-based image segmentation☆47Dec 24, 2025Updated 2 months ago
- ☆23Jan 24, 2024Updated 2 years ago
- Dynamic, high-resolution poverty measurement in data-scarce environments☆10Dec 8, 2024Updated last year
- Code release for "SegLLM: Multi-round Reasoning Segmentation"☆127Feb 20, 2025Updated last year
- [TGRS'25] AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval☆29Jan 6, 2026Updated last month
- [ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"☆53Feb 10, 2025Updated last year
- Landsat-Bench: Datasets and Benchmarks for Landsat Foundation Models☆18Jun 18, 2025Updated 8 months ago
- An up-to-date & curated list of awesome layout to image papers, methods & resources.☆13Jun 28, 2024Updated last year
- ☆18Jan 5, 2026Updated last month
- DescribeEarth: Describe Anything for Remote Sensing Images☆23Feb 24, 2026Updated last week
- [CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆16Oct 4, 2025Updated 4 months ago
- Code and updates for the ScoreRS project.☆40Sep 19, 2025Updated 5 months ago
- ☆28Sep 2, 2025Updated 6 months ago
- RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation☆19Jun 15, 2025Updated 8 months ago
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆58Dec 22, 2025Updated 2 months ago
- [CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation☆18Jul 2, 2023Updated 2 years ago
- Paper List on Earth Observation in the Foundation Model Era☆28Dec 25, 2025Updated 2 months ago
- ☆16Oct 14, 2025Updated 4 months ago
- ThinkGeo is a Comprehensive Benchmark to evaluate Tool-Augmented Agents for Remote Sensing Tasks☆59Feb 20, 2026Updated last week
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆604Jan 17, 2026Updated last month
- ☆41Dec 10, 2024Updated last year
- Open Source Road Datasets☆18Aug 30, 2024Updated last year
- Official implementation and checkpoints of GeoLink remote sensing foundation model in NeurIPS2025.☆53Oct 6, 2025Updated 4 months ago
- Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024☆71Jun 14, 2024Updated last year
- [AAAI2025] Official implementation of the paper "RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Reso…☆19Mar 22, 2025Updated 11 months ago
- EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing☆20May 29, 2025Updated 9 months ago
- Official PyTorch implementation of CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences (CVPR 2024 Po…☆19Apr 29, 2024Updated last year
- Official repo for "TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series"☆28May 14, 2025Updated 9 months ago
- Related papers about Referring Image Segmentation (RIS)☆16Dec 26, 2023Updated 2 years ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆40May 30, 2025Updated 9 months ago
- A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of…☆192Jan 21, 2026Updated last month
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆49Sep 24, 2024Updated last year
- The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"☆16May 3, 2023Updated 2 years ago
- ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)☆18Apr 2, 2025Updated 11 months ago
- [ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆107Nov 22, 2025Updated 3 months ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆163Nov 8, 2025Updated 3 months ago
- The P3 Dataset: Pixels, Points and Polygons for Multimodal Building Vectorization☆38Dec 11, 2025Updated 2 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆36Nov 21, 2025Updated 3 months ago
- [AAAI 2026 Oral] DynamicEarth: How Far are We from Open-Vocabulary Change Detection?☆109Dec 23, 2025Updated 2 months ago