The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
☆36,766May 8, 2026Updated this week
Alternatives and similar repositories for pytorch-image-models
Users that are interested in pytorch-image-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆25,159May 1, 2026Updated last week
- OpenMMLab Detection Toolbox and Benchmark☆32,679Aug 21, 2024Updated last year
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,901Jul 24, 2024Updated last year
- Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125☆15,287Jun 25, 2025Updated 10 months ago
- A PyTorch implementation of EfficientNet☆8,216Apr 8, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆34,456Apr 7, 2026Updated last month
- ☆12,513Mar 3, 2026Updated 2 months ago
- End-to-End Object Detection with Transformers☆15,267Mar 12, 2024Updated 2 years ago
- Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.☆11,537Updated this week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆31,128Updated this week
- Official DeiT repository☆4,342Mar 15, 2024Updated 2 years ago
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆12,803Apr 25, 2026Updated 2 weeks ago
- Code release for ConvNeXt model☆6,366Jan 8, 2023Updated 3 years ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆33,477Mar 25, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.☆9,110Apr 22, 2022Updated 4 years ago
- 🐍 Geometric Computer Vision Library for Spatial AI☆11,202Updated this week
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆8,308Jul 23, 2024Updated last year
- Datasets, Transforms and Models specific to Computer Vision☆17,667Updated this week
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆160,559Updated this week
- OpenMMLab Semantic Segmentation Toolbox and Benchmark.☆9,786Aug 13, 2024Updated last year
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆54,121Sep 18, 2024Updated last year
- An open source implementation of CLIP.☆13,800Updated this week
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,551Jul 3, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CVPR 2026 论文和开源项目合集☆22,539Mar 8, 2026Updated 2 months ago
- Image augmentation for machine learning experiments.☆14,735Jul 30, 2024Updated last year
- The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.☆6,320Aug 17, 2025Updated 8 months ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,482Apr 19, 2026Updated 3 weeks ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,961Updated this week
- YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite☆57,372May 6, 2026Updated last week
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆33,599Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,123Jan 23, 2026Updated 3 months ago
- 🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.…☆12,178Mar 16, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- RepVGG: Making VGG-style ConvNets Great Again☆3,474Feb 10, 2023Updated 3 years ago
- Visualizer for neural network, deep learning and machine learning models☆32,855Updated this week
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆5,135Feb 3, 2026Updated 3 months ago
- OpenMMLab Computer Vision Foundation☆6,441Jan 29, 2026Updated 3 months ago
- YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documenta…☆10,451Jun 8, 2025Updated 11 months ago
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.☆23,879Sep 1, 2025Updated 8 months ago
- Object detection, 3D detection, and pose estimation using center point detection:☆7,568Mar 2, 2023Updated 3 years ago