geshang777/Seg-R1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/geshang777/Seg-R1)

geshang777 / Seg-R1

[NeurIPS-W 2025] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"

☆72

Alternatives and similar repositories for Seg-R1

Users that are interested in Seg-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aim-uofa / SegAgent
View on GitHub
[CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
☆106Aug 8, 2025Updated 11 months ago
AI-Application-and-Integration-Lab / SAM4MLLM
View on GitHub
[ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation
☆51Mar 20, 2025Updated last year
geshang777 / pix2cap
View on GitHub
Official Implementation of "Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning"
☆28Dec 16, 2025Updated 7 months ago
songw-zju / PixelThink
View on GitHub
The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (ICML 2026)
☆43Jul 4, 2026Updated 2 weeks ago
1e12Leon / RemoteReasoner
View on GitHub
[AAAI 26] Official repo of "RemoteReasoner: Towards Unifying Geospatial Reasoning Workflow"
☆16Nov 24, 2025Updated 7 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
rui-qian / UGround
View on GitHub
Rui Qian, Xin Yin, Chuanhang Deng, et al.: UGround: Towards Unified Visual Grounding with Unrolled Transformers (ICML 2026)
☆29Jun 18, 2026Updated last month
JIA-Lab-research / VisionReasoner
View on GitHub
[ICLR 2026] VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning
☆348Feb 9, 2026Updated 5 months ago
JIA-Lab-research / Seg-Zero
View on GitHub
Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
☆635Jan 17, 2026Updated 6 months ago
yayafengzi / ALToLLM
View on GitHub
ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation
☆30May 27, 2025Updated last year
likaiucas / DragOSM
View on GitHub
TPAMI Underreview paper: DragOSM
☆19Feb 26, 2026Updated 4 months ago
earth-insights / Advanced-Earth-Observation
View on GitHub
Paper List on Earth Observation in the Foundation Model Era
☆31Jun 15, 2026Updated last month
wanghao9610 / X-SAM
View on GitHub
[AAAI2026] X-SAM: From Segment Anything to Any Segmentation
☆382Jul 14, 2026Updated last week
eVI-group-SCU / Dr-Seg
View on GitHub
[CVPR'26] Dr. Seg: Revisiting GRPO Training for Visual Large Language Models through Perception-Oriented Design
☆31Mar 7, 2026Updated 4 months ago
suikei-wang / RESAnything
View on GitHub
[NeurIPS 2025] RESAnything: Attribute Prompting for Arbitrary Referring Segmentation
☆19May 26, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mc-lan / Awesome-MLLM-Segmentation
View on GitHub
A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of…
☆229Jun 28, 2026Updated 3 weeks ago
StriveZs / ALPS
View on GitHub
ALPS: An Auto-Labeling and Pre-training Scheme for Remote Sensing Segmentation With Segment Anything Model
☆21Aug 20, 2024Updated last year
Yanhui-Lee / IAD-R1
View on GitHub
We propose IAD-R1, a universal post-training framework that enhances Vision-Language Models for industrial anomaly detection through a tw…
☆95Dec 9, 2025Updated 7 months ago
guangqian-guo / GleSAM
View on GitHub
The official code of our CVPR2025 paper: "Segment Any-Quality Images with Generative Latent Space Enhancement".
☆44Sep 27, 2025Updated 9 months ago
echo840 / LIRA
View on GitHub
[ICCV 2025] LIRA
☆22Nov 25, 2025Updated 7 months ago
PolyU-ChenLab / UniPixel
View on GitHub
🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)
☆247Jan 4, 2026Updated 6 months ago
zhangzilongc / SOFS
View on GitHub
☆42Jan 30, 2025Updated last year
Inha-CVAI / M2SFormer_ICCV2025
View on GitHub
ICCV2025 Accepted Paper "M2SFormer: Multi-Spectral and Multi-Scale Attention with Edge-Aware Difficulty Guidance for Image Forgery Locali…
☆18Jul 2, 2026Updated 2 weeks ago
yayafengzi / LMM-HiMTok
View on GitHub
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model
☆97Jul 17, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Kyyle2114 / Convolutional-Adapter-for-Segment-Anything
View on GitHub
CAD - Memory Efficient Convolutional Adapter for Segment Anything
☆12Oct 4, 2024Updated last year
letitiabanana / PnP-OVSS
View on GitHub
[CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
☆18Jul 22, 2024Updated last year
HKUST-LongGroup / STAMP
View on GitHub
[CVPR 2026] STAMP: Better, Stronger, Faster: Tackling the Trilemma in MLLM-based Segmentation with Simultaneous Textual Mask Prediction
☆39Feb 21, 2026Updated 5 months ago
Yxxxb / LAVT-RS
View on GitHub
[CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation
☆26Jan 21, 2025Updated last year
arnaudjudge / RL4Seg
View on GitHub
Domain adaptation framework for segmentation via reinforcement learning.
☆16Updated this week
SuhoPark0706 / FCP
View on GitHub
Official code for Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation
☆39Jan 22, 2025Updated last year
jerrywyn / MEET_code
View on GitHub
☆17Feb 26, 2025Updated last year
StevenMsy / DirectSAM-RS
View on GitHub
official code for "DirectSAM-RS"
☆83Nov 18, 2025Updated 8 months ago
HKUST-LongGroup / DyME
View on GitHub
[ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration
☆18Mar 18, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
guanwei49 / EMIT
View on GitHub
EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO
☆27Jan 24, 2026Updated 5 months ago
MIGHTYEZ / Inversion-DPO
View on GitHub
☆19Jul 22, 2025Updated 11 months ago
saccharomycetes / mllms_know
View on GitHub
[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'
☆380Apr 20, 2025Updated last year
KupynOrest / s3od
View on GitHub
[ICLR 2026] Official repo for S3OD: Towards Generalizable Salient Object Detection with Synthetic Data
☆43Jun 3, 2026Updated last month
JacobSRPage / super-res-dynamical
View on GitHub
☆12Dec 13, 2024Updated last year
debby-0527 / SAM3-I
View on GitHub
Official code and resources for SAM3-I.
☆175Apr 14, 2026Updated 3 months ago
VisionXLab / Awesome-RS-VL-Data
View on GitHub
Awesome Remote Sensing Vision-Language Datasets
☆95Jun 25, 2026Updated 3 weeks ago