frank-xwang/UnSAM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/frank-xwang/UnSAM)

frank-xwang / UnSAM

[NeurIPS 2024] Code release for "Segment Anything without Supervision"

☆503

Alternatives and similar repositories for UnSAM

Users that are interested in UnSAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / CutLER
View on GitHub
Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupe…
☆1,071Apr 14, 2026Updated 3 months ago
HarborYuan / ovsam
View on GitHub
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
☆1,031Aug 4, 2025Updated 11 months ago
u2seg / U2Seg
View on GitHub
[CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"
☆233May 7, 2024Updated 2 years ago
UX-Decoder / DINOv
View on GitHub
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
☆542Apr 8, 2024Updated 2 years ago
NVlabs / RADIO
View on GitHub
Official repository for "AM-RADIO: Reduce All Domains Into One"
☆1,901May 29, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
UX-Decoder / Semantic-SAM
View on GitHub
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
☆2,853Jul 10, 2025Updated last year
yujunwei04 / UnSAMv2
View on GitHub
Code release for "UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity"
☆82Feb 1, 2026Updated 5 months ago
siyuanliii / masa
View on GitHub
Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything
☆1,375May 1, 2025Updated last year
lxtGH / OMG-Seg
View on GitHub
Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
☆1,351Oct 15, 2025Updated 9 months ago
fanq15 / Stable-SAM
View on GitHub
☆73Dec 6, 2023Updated 2 years ago
facebookresearch / sam2
View on GitHub
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…
☆19,595May 30, 2026Updated last month
xushilin1 / RMP-SAM
View on GitHub
[ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything
☆271Apr 11, 2025Updated last year
yformer / EfficientSAM
View on GitHub
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
☆2,487Dec 24, 2024Updated last year
SysCV / sam-hq
View on GitHub
Segment Anything in High Quality [NeurIPS 2023]
☆4,246Sep 12, 2025Updated 10 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
NVlabs / ODISE
View on GitHub
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
☆945Jul 6, 2024Updated 2 years ago
berkeley-hipie / segllm
View on GitHub
Code release for "SegLLM: Multi-round Reasoning Segmentation"
☆129Feb 20, 2025Updated last year
facebookresearch / perception_models
View on GitHub
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
☆2,329Apr 13, 2026Updated 3 months ago
baaivision / tokenize-anything
View on GitHub
[ECCV 2024] Tokenize Anything via Prompting
☆601Dec 11, 2024Updated last year
qianduoduolr / DecoMotion
View on GitHub
[ECCV 2024] Decomposition Betters Tracking Everything Everywhere
☆112Jul 10, 2024Updated 2 years ago
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,960Aug 15, 2024Updated last year
OpenGVLab / VisionLLM
View on GitHub
VisionLLM Series
☆1,153Feb 27, 2025Updated last year
robustsam / RobustSAM
View on GitHub
RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)
☆368Aug 31, 2024Updated last year
apple / ml-4m
View on GitHub
4M: Massively Multimodal Masked Modeling
☆1,808Jun 2, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FoundationVision / GLEE
View on GitHub
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
☆1,172Oct 21, 2024Updated last year
para-lost / ECHO
View on GitHub
Echo: "Constantly Improving Image Models Need Constantly Improving Benchmarks" (ICLR 2026)
☆20Jan 29, 2026Updated 5 months ago
Jiahao000 / MosaicFusion
View on GitHub
[IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
☆129Oct 8, 2024Updated last year
mhamilton723 / FeatUp
View on GitHub
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
☆1,652Jun 28, 2024Updated 2 years ago
earth-insights / awesome-layout-to-image
View on GitHub
An up-to-date & curated list of awesome layout to image papers, methods & resources.
☆13Jun 28, 2024Updated 2 years ago
hkchengrex / Tracking-Anything-with-DEVA
View on GitHub
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
☆1,506Apr 26, 2025Updated last year
UX-Decoder / Segment-Everything-Everywhere-All-At-Once
View on GitHub
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
☆4,795Aug 19, 2024Updated last year
Haiyang-W / GiT
View on GitHub
[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
☆364Jan 14, 2025Updated last year
UX-Decoder / FIND
View on GitHub
[NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"
☆132Aug 21, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
callsys / ControlCap
View on GitHub
[ECCV 2024] ControlCap: Controllable Region-level Captioning
☆81Oct 25, 2024Updated last year
OliverRensu / D-iGPT
View on GitHub
[ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…
☆99May 3, 2024Updated 2 years ago
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆13,157Jun 3, 2026Updated last month
IDEA-Research / Grounding-DINO-1.5-API
View on GitHub
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
☆1,139Jan 21, 2025Updated last year
hustvl / EVF-SAM
View on GitHub
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
☆505Mar 17, 2025Updated last year
Jiawei-Yang / Denoising-ViT
View on GitHub
This is the official code release for our work, Denoising Vision Transformers.
☆399Nov 13, 2024Updated last year
baaivision / Emu3
View on GitHub
Next-Token Prediction is All You Need
☆2,433Jan 12, 2026Updated 6 months ago