[NeurIPS-W 2025] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"
☆64Jul 1, 2025Updated 8 months ago
Alternatives and similar repositories for Seg-R1
Users that are interested in Seg-R1 are comparing it to the libraries listed below
Sorting:
- Official Implementation of "Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning"☆26Dec 16, 2025Updated 3 months ago
- RESAnything: Attribute Prompting for Arbitrary Referring Segmentation☆17Nov 28, 2025Updated 3 months ago
- Paper List on Earth Observation in the Foundation Model Era☆30Updated this week
- [AAAI 2025] Official Implementation of "FOCUS: Towards Universal Foreground Segmentation"☆57Jul 8, 2025Updated 8 months ago
- [CVPR2026 🌟] The first attempt to Marine Open Vocabulary Instance Segmentation☆43Feb 24, 2026Updated 3 weeks ago
- [ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation☆49Mar 20, 2025Updated last year
- CAD - Memory Efficient Convolutional Adapter for Segment Anything☆12Oct 4, 2024Updated last year
- [CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆24Jan 21, 2025Updated last year
- ☆41Jan 30, 2025Updated last year
- Code for "YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Ass…☆20Nov 5, 2025Updated 4 months ago
- [ACM MM 25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"☆216Jan 4, 2026Updated 2 months ago
- (ICCV 2025) DictAS: A Framework for Class-Generalizable Few-Shot Anomaly Segmentation via Dictionary Lookup☆57Dec 13, 2025Updated 3 months ago
- Official code for Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation☆36Jan 22, 2025Updated last year
- ☆31Sep 19, 2025Updated 6 months ago
- [CVPR 2025] SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation☆237Oct 16, 2025Updated 5 months ago
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆23Jul 3, 2025Updated 8 months ago
- A paper list of self-supervised pretrain method☆22Aug 15, 2025Updated 7 months ago
- Qwen-SAM is a reasoning-based segmentation model that integrates Qwen 2.5 VL 7B with the Segment Anything Model (SAM), enabling fine-grai…☆27Jun 4, 2025Updated 9 months ago
- Parameter Efficient Fine-Tuning of Segment Anything Model☆32Oct 8, 2025Updated 5 months ago
- VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning☆325Feb 9, 2026Updated last month
- EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO☆21Jan 24, 2026Updated last month
- UICrit is a dataset containing human-generated natural language design critiques, corresponding bounding boxes for each critique, and des…☆26Nov 19, 2024Updated last year
- [MM 2023] Toward High Quality Facial Representation Learning☆19Oct 30, 2023Updated 2 years ago
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆614Jan 17, 2026Updated 2 months ago
- ☆36Apr 14, 2023Updated 2 years ago
- ☆18Jan 5, 2026Updated 2 months ago
- ☆14May 26, 2025Updated 9 months ago
- [ACM Multimedia 2025🎉] The project for the paper titled "MediSee: Reasoning-based Pixel-level Perception in Medical Images"☆26Nov 19, 2025Updated 4 months ago
- SAM Adaptation using SVD☆12Jul 13, 2025Updated 8 months ago
- ☆27Sep 13, 2022Updated 3 years ago
- SPICE: A Synergistic, Precise, Iterative, and Customizable Image Editing Workflow☆24Feb 6, 2026Updated last month
- Efficient Semantic Fine-grained Prior Generation and Refinement Decoder Based on SAM for Improved Multi-organ Segmentation☆21Mar 26, 2025Updated 11 months ago
- FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients☆13Jan 22, 2025Updated last year
- SDGAN: Disentangling Semantic Manipulation for Facial Attribute Editing☆13Apr 21, 2024Updated last year
- ☆35Oct 27, 2025Updated 4 months ago
- ☆18Jul 22, 2025Updated 7 months ago
- [CVPR 2026] ZoomEarth: Active Perception for Ultra-High-Resolution Geospatial Vision-Language Tasks☆32Mar 10, 2026Updated last week
- Concealed Scene Understanding, Visual Intelligence (VI), 2023☆70Aug 15, 2025Updated 7 months ago
- ☆54Feb 9, 2026Updated last month