OpenGVLab/InternImage

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenGVLab/InternImage)

OpenGVLab / InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

☆2,836

Alternatives and similar repositories for InternImage

Users that are interested in InternImage are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

czczup / ViT-Adapter
View on GitHub
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
☆1,503Jun 3, 2025Updated last year
baaivision / EVA
View on GitHub
EVA Series: Visual Representation Fantasies from BAAI
☆2,686Aug 1, 2024Updated last year
OpenGVLab / DCNv4
View on GitHub
[CVPR 2024] Deformable Convolution v4
☆743May 17, 2024Updated 2 years ago
IDEA-Research / DINO
View on GitHub
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
☆2,825Jul 31, 2024Updated last year
Sense-X / Co-DETR
View on GitHub
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
☆1,355Dec 29, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
IDEA-Research / MaskDINO
View on GitHub
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segme…
☆1,542Dec 20, 2023Updated 2 years ago
IDEA-Research / detrex
View on GitHub
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
☆2,303Sep 11, 2025Updated 10 months ago
baaivision / Painter
View on GitHub
Painter & SegGPT Series: Vision Foundation Models from BAAI
☆2,593Dec 6, 2024Updated last year
IDEA-Research / Grounded-Segment-Anything
View on GitHub
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …
☆17,676Sep 5, 2024Updated last year
IDEA-Research / GroundingDINO
View on GitHub
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
☆10,415Aug 12, 2024Updated last year
open-mmlab / mmdetection
View on GitHub
OpenMMLab Detection Toolbox and Benchmark
☆32,833Aug 21, 2024Updated last year
open-mmlab / mmsegmentation
View on GitHub
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
☆9,878Aug 13, 2024Updated last year
UX-Decoder / Segment-Everything-Everywhere-All-At-Once
View on GitHub
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
☆4,795Aug 19, 2024Updated last year
facebookresearch / segment-anything
View on GitHub
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…
☆54,561Sep 18, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / Mask2Former
View on GitHub
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
☆3,412Jul 29, 2024Updated last year
SHI-Labs / OneFormer
View on GitHub
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
☆1,732Oct 3, 2024Updated last year
UX-Decoder / Semantic-SAM
View on GitHub
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
☆2,852Jul 10, 2025Updated last year
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,003Jul 24, 2024Updated last year
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆13,124Jun 3, 2026Updated last month
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆36,993Updated this week
microsoft / GLIP
View on GitHub
Grounded Language-Image Pre-training
☆2,605Jan 24, 2024Updated 2 years ago
OpenGVLab / STM-Evaluation
View on GitHub
☆70Jun 9, 2026Updated last month
OpenGVLab / InternVL
View on GitHub
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
☆10,098Sep 22, 2025Updated 9 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
fundamentalvision / BEVFormer
View on GitHub
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object …
☆4,546Aug 15, 2024Updated last year
CVPR2023-3D-Occupancy-Prediction / CVPR2023-3D-Occupancy-Prediction
View on GitHub
CVPR2023-Occupancy-Prediction-Challenge
☆878Jul 31, 2023Updated 2 years ago
OpenGVLab / M3I-Pretraining
View on GitHub
[CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.
☆91Jun 1, 2023Updated 3 years ago
microsoft / FocalNet
View on GitHub
[NeurIPS 2022] Official code for "Focal Modulation Networks"
☆749Nov 7, 2023Updated 2 years ago
fundamentalvision / Uni-Perceiver
View on GitHub
☆291Aug 14, 2025Updated 11 months ago
facebookresearch / detr
View on GitHub
End-to-End Object Detection with Transformers
☆15,349Mar 12, 2024Updated 2 years ago
microsoft / X-Decoder
View on GitHub
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
☆1,346Oct 5, 2023Updated 2 years ago
SysCV / sam-hq
View on GitHub
Segment Anything in High Quality [NeurIPS 2023]
☆4,244Sep 12, 2025Updated 10 months ago
fundamentalvision / Deformable-DETR
View on GitHub
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
☆4,001May 16, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
IDEA-Research / OpenSeeD
View on GitHub
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
☆762Jan 22, 2024Updated 2 years ago
OpenDriveLab / UniAD
View on GitHub
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
☆4,692Oct 29, 2025Updated 8 months ago
facebookresearch / ConvNeXt-V2
View on GitHub
Code release for ConvNeXt V2 model
☆2,065Aug 14, 2024Updated last year
ShoufaChen / DiffusionDet
View on GitHub
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
☆2,259Dec 22, 2022Updated 3 years ago
facebookresearch / MaskFormer
View on GitHub
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
☆1,462Mar 11, 2022Updated 4 years ago
OpenGVLab / VisionLLM
View on GitHub
VisionLLM Series
☆1,152Feb 27, 2025Updated last year
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,366Jul 23, 2024Updated last year