[CVPR 2026] STAMP: Better, Stronger, Faster: Tackling the Trilemma in MLLM-based Segmentation with Simultaneous Textual Mask Prediction
☆34Feb 21, 2026Updated last week
Alternatives and similar repositories for STAMP
Users that are interested in STAMP are comparing it to the libraries listed below
Sorting:
- UGround: Towards Unified Visual Grounding with Unrolled Transformers☆21Feb 15, 2026Updated 2 weeks ago
- Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"☆60Dec 18, 2025Updated 2 months ago
- Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)☆51Feb 4, 2026Updated last month
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated last year
- Official implementation of the RSE paper mKGR.☆20Jan 15, 2026Updated last month
- ICTNet: a novel network for semantic segmentation with the underlying architecture of a fully convolutional network, infused with feature…☆10May 27, 2020Updated 5 years ago
- ☆11Feb 5, 2024Updated 2 years ago
- Code for paper DNAS: Decoupling Neural Architecture Search for High-Resolution Remote Sensing Image Semantic Segmentation.☆12Sep 20, 2023Updated 2 years ago
- Landsat-Bench: Datasets and Benchmarks for Landsat Foundation Models☆18Jun 18, 2025Updated 8 months ago
- [AAAI 2026 Oral] LENS: Learning to Segment Anything with Unified Reinforced Reasoning☆106Dec 3, 2025Updated 3 months ago
- Official implementation of the ICCV 2025 paper HoliTracer.☆40Jan 13, 2026Updated last month
- ☆15Feb 26, 2025Updated last year
- [AAAI2026 demo] Official repo of “AirNavigation: Let UAV Navigation Tells Its Own Story”☆18Nov 1, 2025Updated 4 months ago
- Repository for the paper "Unsupervised Representation Learning of Spatial Data via Multimodal Embedding"☆12Dec 5, 2019Updated 6 years ago
- [WWW24-UrbanCLIP] A comprehensive toolkit designed to facilitate the collection, processing, and integration of satellite imagery and ass…☆17Oct 6, 2024Updated last year
- [ISPRS P&RS'25] Official repository of the paper Cross-View Geo-Localization with Panoramic Street-View and VHR Satellite Imagery in Dece…☆20Nov 10, 2025Updated 3 months ago
- [www2025]DSFNet: Learning Disentangled Scenario Factorization for Multi-Scenario Route Ranking. This paper’s open dataset and implementat…☆29Sep 9, 2025Updated 5 months ago
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated last month
- [NeurIPS 2025] The official repo of "DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding".☆25Feb 7, 2026Updated 3 weeks ago
- ☆19Aug 11, 2025Updated 6 months ago
- Official repo for "DynaMITe: Dynamic Query Bootstrapping for Multi-object Interactive Segmentation Transformer"☆19Sep 29, 2023Updated 2 years ago
- [CVPR 2025] iSegMan: Interactive Segment-and-Manipulate 3D Gaussians 🔥🔥🔥☆24Mar 12, 2025Updated 11 months ago
- [MM2024] FusionOcc: Multi-Modal Fusion for 3D Occupancy Prediction☆21Dec 6, 2024Updated last year
- The official implementation code for Plug-and-Play PPO: An Adaptive Point Prompt Optimizer Making SAM Greater.☆28Jan 28, 2026Updated last month
- ☆29Sep 20, 2025Updated 5 months ago
- Knowledge Graph Large Language Model (KG-LLM)☆36Jun 23, 2024Updated last year
- The Official benchmark for continual learning for deepfake audio detection☆21Sep 26, 2024Updated last year
- Continual Learning Method RWM for AAAI 2024☆22Sep 26, 2024Updated last year
- ☆39Jun 25, 2025Updated 8 months ago
- A web map showing the location and names of the Sentinel-2 grids☆44Feb 19, 2026Updated 2 weeks ago
- ☆23Apr 19, 2024Updated last year
- ☆23Apr 13, 2023Updated 2 years ago
- ☆30Jan 18, 2026Updated last month
- Satellite-Ground Fusion for 3D Semantic Scene Completion☆28Sep 8, 2025Updated 5 months ago
- ☆25Jun 16, 2024Updated last year
- ☆26Sep 30, 2025Updated 5 months ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents☆99Feb 2, 2026Updated last month
- Official code of "ALIM: Adjusting Label Importance Mechanism for Noisy Partial Label Learning"☆24Sep 25, 2023Updated 2 years ago