facebookresearch/Mask2Former

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/Mask2Former)

facebookresearch / Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

☆3,408

Alternatives and similar repositories for Mask2Former

Users that are interested in Mask2Former are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / MaskFormer
View on GitHub
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
☆1,462Mar 11, 2022Updated 4 years ago
IDEA-Research / MaskDINO
View on GitHub
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segme…
☆1,541Dec 20, 2023Updated 2 years ago
SHI-Labs / OneFormer
View on GitHub
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
☆1,730Oct 3, 2024Updated last year
czczup / ViT-Adapter
View on GitHub
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
☆1,502Jun 3, 2025Updated last year
open-mmlab / mmsegmentation
View on GitHub
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
☆9,877Aug 13, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
NVlabs / SegFormer
View on GitHub
Official PyTorch implementation of SegFormer
☆3,581Aug 2, 2024Updated last year
microsoft / X-Decoder
View on GitHub
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
☆1,346Oct 5, 2023Updated 2 years ago
ZwwWayne / K-Net
View on GitHub
[NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation
☆484Dec 16, 2021Updated 4 years ago
fundamentalvision / Deformable-DETR
View on GitHub
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
☆3,996May 16, 2024Updated 2 years ago
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆13,109Jun 3, 2026Updated last month
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆15,996Jul 24, 2024Updated last year
facebookresearch / detr
View on GitHub
End-to-End Object Detection with Transformers
☆15,336Mar 12, 2024Updated 2 years ago
UX-Decoder / Semantic-SAM
View on GitHub
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
☆2,848Jul 10, 2025Updated last year
facebookresearch / ConvNeXt
View on GitHub
Code release for ConvNeXt model
☆6,413Jan 8, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
IDEA-Research / DINO
View on GitHub
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
☆2,825Jul 31, 2024Updated last year
UX-Decoder / Segment-Everything-Everywhere-All-At-Once
View on GitHub
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
☆4,794Aug 19, 2024Updated last year
facebookresearch / segment-anything
View on GitHub
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…
☆54,550Sep 18, 2024Updated last year
NVlabs / ODISE
View on GitHub
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
☆945Jul 6, 2024Updated 2 years ago
IDEA-Research / Grounded-Segment-Anything
View on GitHub
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …
☆17,666Sep 5, 2024Updated last year
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,364Jul 23, 2024Updated last year
facebookresearch / detectron2
View on GitHub
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
☆34,599Jun 7, 2026Updated last month
facebookresearch / dino
View on GitHub
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
☆7,600Jul 3, 2024Updated 2 years ago
YuqingWang1029 / VisTR
View on GitHub
[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers
☆757Jul 15, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / Detic
View on GitHub
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
☆2,007Mar 21, 2024Updated 2 years ago
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆36,986Updated this week
facebookresearch / sam2
View on GitHub
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…
☆19,533May 30, 2026Updated last month
wjf5203 / SeqFormer
View on GitHub
SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)
☆350Aug 2, 2022Updated 3 years ago
IDEA-Research / GroundingDINO
View on GitHub
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
☆10,394Aug 12, 2024Updated last year
zzubqh / Mask2Former-Simplify
View on GitHub
☆178Dec 6, 2023Updated 2 years ago
aim-uofa / AdelaiDet
View on GitHub
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
☆3,483Aug 23, 2024Updated last year
OpenGVLab / InternImage
View on GitHub
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
☆2,835Mar 25, 2025Updated last year
open-mmlab / mmdetection
View on GitHub
OpenMMLab Detection Toolbox and Benchmark
☆32,813Aug 21, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ShoufaChen / DiffusionDet
View on GitHub
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
☆2,259Dec 22, 2022Updated 3 years ago
FoundationVision / VNext
View on GitHub
Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and…
☆617Feb 21, 2024Updated 2 years ago
IDEA-Research / detrex
View on GitHub
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
☆2,302Sep 11, 2025Updated 10 months ago
NVlabs / GroupViT
View on GitHub
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
☆788May 10, 2022Updated 4 years ago
IDEA-Research / OpenSeeD
View on GitHub
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
☆762Jan 22, 2024Updated 2 years ago
microsoft / GLIP
View on GitHub
Grounded Language-Image Pre-training
☆2,604Jan 24, 2024Updated 2 years ago
JIA-Lab-research / LISA
View on GitHub
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
☆2,660Feb 16, 2025Updated last year