congvvc/InstructSeg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/congvvc/InstructSeg)

congvvc / InstructSeg

[ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"

☆56

Alternatives and similar repositories for InstructSeg

Users that are interested in InstructSeg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cilinyan / ReVOS-api
View on GitHub
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
☆22Jul 20, 2024Updated 2 years ago
cilinyan / VISA
View on GitHub
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
☆213Aug 5, 2024Updated last year
congvvc / LaSagnA
View on GitHub
Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".
☆63Apr 29, 2024Updated 2 years ago
showlab / VideoLISA
View on GitHub
[NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
☆148Dec 26, 2024Updated last year
rkzheng99 / ViLLa
View on GitHub
Video Reasoning Segmentation
☆26Nov 29, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
berkeley-hipie / segllm
View on GitHub
Code release for "SegLLM: Multi-round Reasoning Segmentation"
☆129Feb 20, 2025Updated last year
dengandong / GroundMoRe
View on GitHub
☆18May 18, 2026Updated 2 months ago
zamling / PSALM
View on GitHub
[ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"
☆269Dec 30, 2024Updated last year
jdg900 / MMR
View on GitHub
[ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segm…
☆28Apr 3, 2025Updated last year
lizhou-cs / mglmm
View on GitHub
☆32Jun 14, 2026Updated last month
linsun449 / iseg.code
View on GitHub
This repo is the official implementation of iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.
☆42May 25, 2026Updated last month
MaverickRen / PixelLM
View on GitHub
[CVPR 2024] PixelLM is an effective and efficient LMM for pixel-level reasoning and understanding.
☆273Feb 11, 2025Updated last year
Shengcao-Cao / groundLMM
View on GitHub
Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision
☆47Oct 19, 2025Updated 9 months ago
JIA-Lab-research / Seg-Zero
View on GitHub
Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
☆635Jan 17, 2026Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Geo-R1 / geo-r1
View on GitHub
☆16Sep 25, 2025Updated 9 months ago
rui-qian / READ
View on GitHub
Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)
☆54Feb 4, 2026Updated 5 months ago
Becomebright / MTV
View on GitHub
Revisiting Multi-Task Visual Representation Learning
☆22Jan 21, 2026Updated 6 months ago
Robertwyq / Object-Affinity
View on GitHub
[TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation
☆14Sep 14, 2023Updated 2 years ago
As-Time-Goes-By / OmniSegNet
View on GitHub
☆19Apr 11, 2026Updated 3 months ago
songw-zju / PixelThink
View on GitHub
The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (ICML 2026)
☆43Jul 4, 2026Updated 2 weeks ago
mc-lan / Text4Seg
View on GitHub
[ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation
☆176Nov 8, 2025Updated 8 months ago
earth-insights / RS-MTDF
View on GitHub
RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation
☆22Jun 15, 2025Updated last year
1e12Leon / RemoteReasoner
View on GitHub
[AAAI 26] Official repo of "RemoteReasoner: Towards Unifying Geospatial Reasoning Workflow"
☆16Nov 24, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SitongGong / VRS-HQ
View on GitHub
High Quality Video Reasoning Segmentation
☆151Nov 24, 2025Updated 7 months ago
hustvl / MaskAdapter
View on GitHub
[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"
☆135Oct 23, 2025Updated 8 months ago
baoxiaoyi / CoReS
View on GitHub
code for the paper "CoReS: Orchestrating the Dance of Reasoning and Segmentation"
☆23Nov 24, 2025Updated 7 months ago
rkzheng99 / TMT-VIS
View on GitHub
Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation (NeurIPS 23)
☆12May 7, 2025Updated last year
xuliu-cyber / RSUniVLM
View on GitHub
☆46Apr 16, 2026Updated 3 months ago
ylingfeng / Add-SD
View on GitHub
Official implementation of Add-SD: Rational Generation without Manual Reference.
☆28Aug 19, 2024Updated last year
wzp8023391 / Interactive-CD-tool
View on GitHub
☆10Dec 12, 2023Updated 2 years ago
dahyun-kang / lavg
View on GitHub
[ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
☆51Sep 24, 2024Updated last year
mc-lan / ProxyCLIP
View on GitHub
[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
☆120Mar 26, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ysj9909 / StAR
View on GitHub
[ECCV 2026] StAR: Segment Anything Reasoner
☆25Apr 2, 2026Updated 3 months ago
Hansxsourse / VRMDiff
View on GitHub
☆11Mar 11, 2025Updated last year
earth-insights / GeoPlan-bench
View on GitHub
GeoPlan-bench is a benchmark platform for evaluating agents in remote sensing task planning. The platform provides a complete workflow fo…
☆24Dec 10, 2025Updated 7 months ago
wusize / F-LMM
View on GitHub
[CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models
☆115May 29, 2025Updated last year
pipilurj / perceptionGPT
View on GitHub
☆18Aug 7, 2024Updated last year
zhu-xlab / GlobalBuildingMap
View on GitHub
☆16Dec 15, 2025Updated 7 months ago
LeapLabTHU / GSVA
View on GitHub
[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models
☆166Sep 12, 2024Updated last year