Ferry-Li / SI-SODLinks
ICML2024: Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection
☆31Updated last year
Alternatives and similar repositories for SI-SOD
Users that are interested in SI-SOD are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆55Updated 3 months ago
- [CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.☆61Updated 8 months ago
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆49Updated 4 months ago
- code for paper: Simultaneous Image to Zero and Zero to Noise: Diffusion Models with Analytical Image Attenuation☆58Updated this week
- ☆94Updated last year
- ☆59Updated last year
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆144Updated 5 months ago
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆133Updated last year
- ☆52Updated 11 months ago
- ☆194Updated 6 months ago
- RESAnything: Attribute Prompting for Arbitrary Referring Segmentation☆16Updated last week
- [AAAI 2025] Official Implementation of "FOCUS: Towards Universal Foreground Segmentation"☆55Updated 4 months ago
- Official PyTorch implementation for TCSVT 23 "Detect Any Shadow: Segment Anything for Video Shadow Detection"☆65Updated last year
- ☆71Updated last year
- ☆130Updated last year
- Official Implementation for CVPR 2024 paper: CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor☆110Updated last year
- [NeurIPS-W 2025] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"☆53Updated 4 months ago
- [CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆177Updated 11 months ago
- Official Implementation of "VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning".☆56Updated last week
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆198Updated last year
- 🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)☆190Updated last month
- Recognize Any Regions☆122Updated 11 months ago
- ☆52Updated 2 years ago
- Video Reasoning Segmentation☆27Updated last year
- A simple baseline for image composition using text-guided inpainting model☆21Updated 4 months ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆116Updated last month
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆95Updated 8 months ago
- [CVPR 2024] LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion.☆49Updated 10 months ago
- Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning☆127Updated 5 months ago
- ☆96Updated 3 months ago