The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
☆36,504Mar 13, 2026Updated last week
Alternatives and similar repositories for pytorch-image-models
Users that are interested in pytorch-image-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenMMLab Detection Toolbox and Benchmark☆32,507Aug 21, 2024Updated last year
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,767Jul 24, 2024Updated last year
- Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125☆15,286Jun 25, 2025Updated 8 months ago
- A PyTorch implementation of EfficientNet☆8,219Apr 8, 2022Updated 3 years ago
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆34,223Mar 16, 2026Updated last week
- ☆12,365Mar 3, 2026Updated 2 weeks ago
- End-to-End Object Detection with Transformers☆15,166Mar 12, 2024Updated 2 years ago
- Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.☆11,403Mar 16, 2026Updated last week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,952Updated this week
- Official DeiT repository☆4,327Mar 15, 2024Updated 2 years ago
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆12,701Apr 7, 2025Updated 11 months ago
- Code release for ConvNeXt model☆6,319Jan 8, 2023Updated 3 years ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,861Feb 18, 2026Updated last month
- Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.☆9,112Apr 22, 2022Updated 3 years ago
- 🐍 Geometric Computer Vision Library for Spatial AI☆11,121Updated this week
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆8,243Jul 23, 2024Updated last year
- Datasets, Transforms and Models specific to Computer Vision☆17,566Updated this week
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆158,060Updated this week
- OpenMMLab Semantic Segmentation Toolbox and Benchmark.☆9,672Aug 13, 2024Updated last year
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆53,684Sep 18, 2024Updated last year
- An open source implementation of CLIP.☆13,528Mar 12, 2026Updated last week
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,485Jul 3, 2024Updated last year
- CVPR 2026 论文和开源项目合集☆22,164Mar 8, 2026Updated 2 weeks ago
- Image augmentation for machine learning experiments.☆14,733Jul 30, 2024Updated last year
- The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.☆6,315Aug 17, 2025Updated 7 months ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,430Feb 20, 2026Updated last month
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,936Updated this week
- YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite☆57,054Updated this week
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆33,085Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,046Jan 23, 2026Updated 2 months ago
- 🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.…☆12,170Mar 16, 2026Updated last week
- RepVGG: Making VGG-style ConvNets Great Again☆3,464Feb 10, 2023Updated 3 years ago
- Visualizer for neural network, deep learning and machine learning models☆32,592Mar 16, 2026Updated last week
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆5,120Feb 3, 2026Updated last month
- OpenMMLab Computer Vision Foundation☆6,415Jan 29, 2026Updated last month
- YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documenta…☆10,376Jun 8, 2025Updated 9 months ago
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.☆23,808Sep 1, 2025Updated 6 months ago
- Object detection, 3D detection, and pose estimation using center point detection:☆7,546Mar 2, 2023Updated 3 years ago
- Pretrained EfficientNet, EfficientNet-Lite, MixNet, MobileNetV3 / V2, MNASNet A1 and B1, FBNet, Single-Path NAS☆1,584Jun 13, 2024Updated last year