[NeurIPS-W 2025] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"
☆60Jul 1, 2025Updated 8 months ago
Alternatives and similar repositories for Seg-R1
Users that are interested in Seg-R1 are comparing it to the libraries listed below
Sorting:
- Official Implementation of "Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning"☆25Dec 16, 2025Updated 2 months ago
- Code for "YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Ass…☆19Nov 5, 2025Updated 3 months ago
- [AAAI 2025] Official Implementation of "FOCUS: Towards Universal Foreground Segmentation"☆56Jul 8, 2025Updated 7 months ago
- CAD - Memory Efficient Convolutional Adapter for Segment Anything☆12Oct 4, 2024Updated last year
- RESAnything: Attribute Prompting for Arbitrary Referring Segmentation☆17Nov 28, 2025Updated 3 months ago
- ☆40Jan 30, 2025Updated last year
- Official code for Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation☆36Jan 22, 2025Updated last year
- Paper List on Earth Observation in the Foundation Model Era☆28Dec 25, 2025Updated 2 months ago
- ☆31Sep 19, 2025Updated 5 months ago
- RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations☆19Oct 13, 2025Updated 4 months ago
- [CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆24Jan 21, 2025Updated last year
- A paper list of self-supervised pretrain method☆22Aug 15, 2025Updated 6 months ago
- UICrit is a dataset containing human-generated natural language design critiques, corresponding bounding boxes for each critique, and des…☆26Nov 19, 2024Updated last year
- [CVPR 2025] SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation☆225Oct 16, 2025Updated 4 months ago
- (ICCV 2025) DictAS: A Framework for Class-Generalizable Few-Shot Anomaly Segmentation via Dictionary Lookup☆54Dec 13, 2025Updated 2 months ago
- Official PyTorch implementation of the paper "Recycling Discriminator: Towards Opinion-Unaware Image Quality Assessment Using Wasserstein…☆25Oct 28, 2021Updated 4 years ago
- ☆27Sep 13, 2022Updated 3 years ago
- [CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories☆92Aug 8, 2025Updated 6 months ago
- ☆28Sep 9, 2023Updated 2 years ago
- [ACM MM 25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"☆207Jan 4, 2026Updated last month
- Guide to build FFmpeg from source with Netflix's libvmaf on Ubuntu 18.04☆11Oct 12, 2020Updated 5 years ago
- ☆18Sep 23, 2025Updated 5 months ago
- Official repository of paper titled "CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications…☆87Jan 15, 2026Updated last month
- [ICCV2025] SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning. Paper is available at https://arxi…☆130Aug 4, 2025Updated 6 months ago
- ☆36Apr 14, 2023Updated 2 years ago
- We propose IAD-R1, a universal post-training framework that enhances Vision-Language Models for industrial anomaly detection through a tw…☆66Dec 9, 2025Updated 2 months ago
- VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning☆321Feb 9, 2026Updated 2 weeks ago
- [TIP2025] The implementation of "Uncertainty Guided Refinement for Fine-grained Salient Object Detection"☆15Apr 20, 2025Updated 10 months ago
- ☆57Mar 6, 2025Updated 11 months ago
- [NeurIPS 2025 Spotlight] Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning☆17Nov 14, 2025Updated 3 months ago
- 肺部CT图像分割系统(MySQL,TKinter)By U-Net☆13May 28, 2024Updated last year
- SpeedVision is an AI-powered tool that detects and calculates vehicle speed from video footage using YOLO-based object detection and fram…☆10Sep 22, 2024Updated last year
- ☆15Nov 11, 2024Updated last year
- source code for ICCV2021 paper "MFNet: Multi-filter Directive Network for Weakly Supervised Salient Object Detection"☆11Jul 17, 2022Updated 3 years ago
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"☆82Oct 15, 2025Updated 4 months ago
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆50Oct 12, 2025Updated 4 months ago
- [CVPR2025] AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios. Paper is available at https://arxiv.org/abs/2410.14…☆147Sep 1, 2025Updated 5 months ago
- ☆37Oct 29, 2025Updated 4 months ago
- Self-Supervised Multi-Scale Transformer with Attention-Guided Fusion for Efficient Crack Detection☆24Jan 17, 2026Updated last month