geshang777 / Seg-R1Links
[arXiv'25] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"
☆32Updated last month
Alternatives and similar repositories for Seg-R1
Users that are interested in Seg-R1 are comparing it to the libraries listed below
Sorting:
- ☆78Updated 2 months ago
- [CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆162Updated 7 months ago
- [CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.☆58Updated 5 months ago
- New generation of CLIP with fine grained discrimination capability, ICML2025☆263Updated 2 weeks ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆60Updated 9 months ago
- [ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation,☆37Updated 4 months ago
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆89Updated 3 months ago
- ☆45Updated 7 months ago
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆89Updated 4 months ago
- ☆93Updated last year
- [AAAI2025] Official Implementation of "FOCUS: Towards Universal Foreground Segmentation"☆40Updated last month
- [ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"☆247Updated 7 months ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆55Updated last year
- Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface"☆213Updated 2 months ago
- ☆41Updated last month
- ☆57Updated 10 months ago
- ☆70Updated last year
- Vision Manus: Your versatile Visual AI assistant☆245Updated last week
- Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024☆35Updated 11 months ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆81Updated last year
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆96Updated 9 months ago
- ☆143Updated last year
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models☆91Updated last month
- ☆27Updated last year
- [AAAI 2025] Official implementation of the paper "EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation"☆29Updated 7 months ago
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆82Updated 4 months ago
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆45Updated this week
- This is the official pytorch implementation of DIS-SAM.☆14Updated 3 months ago
- [NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)☆98Updated last week
- ☆124Updated last year