[CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"
☆230May 7, 2024Updated last year
Alternatives and similar repositories for U2Seg
Users that are interested in U2Seg are comparing it to the libraries listed below
Sorting:
- Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupe…☆1,059Jun 4, 2025Updated 9 months ago
- [NeurIPS 2024] Code release for "Segment Anything without Supervision"☆498Nov 20, 2025Updated 3 months ago
- Unsupervised Semantic Segmentation by Distilling Feature Correspondences☆785Mar 24, 2023Updated 2 years ago
- [CVPRW'23 Best Paper Award] Zero-shot Unsupervised Transfer Instance Segmentation☆24Aug 22, 2023Updated 2 years ago
- [NeurIPS 2023] SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation☆26Dec 5, 2023Updated 2 years ago
- [CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses tha…☆945Aug 5, 2025Updated 7 months ago
- [NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…☆337Feb 5, 2024Updated 2 years ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆47Jun 16, 2024Updated last year
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆131Dec 3, 2023Updated 2 years ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Sep 12, 2023Updated 2 years ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆201Feb 5, 2024Updated 2 years ago
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆275Oct 26, 2024Updated last year
- Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]☆1,342Oct 15, 2025Updated 4 months ago
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Mar 20, 2025Updated 11 months ago
- ☆37Oct 18, 2023Updated 2 years ago
- A summary of recent unsupervised semantic segmentation methods☆100May 8, 2023Updated 2 years ago
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆17Dec 6, 2023Updated 2 years ago
- Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals. [ICCV 2021]☆413Jun 14, 2022Updated 3 years ago
- [ECCV 2024] The official code of paper "Open-Vocabulary SAM".☆1,029Aug 4, 2025Updated 7 months ago
- Unsupervised Hierarchical Semantic Segmentation with Multiview Cosegmentation and Clustering Transformers☆74Apr 2, 2024Updated last year
- [ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection☆99Sep 12, 2023Updated 2 years ago
- a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.☆81Jul 28, 2023Updated 2 years ago
- [CVPR'24] MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding☆17Dec 13, 2024Updated last year
- FreeSOLO for unsupervised instance segmentation, CVPR 2022☆318Jan 16, 2023Updated 3 years ago
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Jul 11, 2024Updated last year
- This is the official code release for our work, Denoising Vision Transformers.☆394Nov 13, 2024Updated last year
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆49Sep 24, 2024Updated last year
- [CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation☆1,703Oct 3, 2024Updated last year
- ☆32Jun 1, 2023Updated 2 years ago
- [ICCV 2023] PointDC: Unsupervised Semantic Segmentation of 3D Point Clouds via Cross-modal Distillation and Super-Voxel Clustering☆36Nov 27, 2024Updated last year
- ☆18Nov 15, 2024Updated last year
- ☆17Dec 13, 2023Updated 2 years ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆268Apr 11, 2025Updated 10 months ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,343Oct 5, 2023Updated 2 years ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆97Mar 26, 2025Updated 11 months ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆73Jun 11, 2024Updated last year
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Oct 23, 2023Updated 2 years ago
- Instance-wise Occlusion and Depth Orders in Natural Scenes (CVPR 2022)☆45Apr 6, 2022Updated 3 years ago
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆99Jul 15, 2024Updated last year