The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
☆36,397Feb 23, 2026Updated last week
Alternatives and similar repositories for pytorch-image-models
Users that are interested in pytorch-image-models are comparing it to the libraries listed below
Sorting:
- OpenMMLab Detection Toolbox and Benchmark☆32,418Aug 21, 2024Updated last year
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,716Jul 24, 2024Updated last year
- Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125☆15,268Jun 25, 2025Updated 8 months ago
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆34,126Nov 17, 2025Updated 3 months ago
- A PyTorch implementation of EfficientNet☆8,221Apr 8, 2022Updated 3 years ago
- End-to-End Object Detection with Transformers☆15,124Mar 12, 2024Updated last year
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,884Updated this week
- ☆12,318Jan 30, 2026Updated last month
- Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.☆11,347Dec 23, 2025Updated 2 months ago
- Official DeiT repository☆4,325Mar 15, 2024Updated last year
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,642Feb 18, 2026Updated last week
- Code release for ConvNeXt model☆6,300Jan 8, 2023Updated 3 years ago
- 🐍 Geometric Computer Vision Library for Spatial AI☆11,093Feb 23, 2026Updated last week
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆12,643Apr 7, 2025Updated 10 months ago
- Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.☆9,114Apr 22, 2022Updated 3 years ago
- Datasets, Transforms and Models specific to Computer Vision☆17,527Updated this week
- An open source implementation of CLIP.☆13,430Updated this week
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆157,071Updated this week
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆8,230Jul 23, 2024Updated last year
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆53,497Sep 18, 2024Updated last year
- OpenMMLab Semantic Segmentation Toolbox and Benchmark.☆9,626Aug 13, 2024Updated last year
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,459Jul 3, 2024Updated last year
- Image augmentation for machine learning experiments.☆14,731Jul 30, 2024Updated last year
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,401Feb 20, 2026Updated last week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,926Feb 24, 2026Updated last week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,033Jan 23, 2026Updated last month
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆32,873Updated this week
- YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite☆56,881Feb 20, 2026Updated last week
- The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.☆6,309Aug 17, 2025Updated 6 months ago
- CVPR 2026 论文和开源项目合集☆21,890Updated this week
- Visualizer for neural network, deep learning and machine learning models☆32,465Feb 24, 2026Updated last week
- OpenMMLab Computer Vision Foundation☆6,410Jan 29, 2026Updated last month
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.☆23,765Sep 1, 2025Updated 6 months ago
- Google Research☆37,367Updated this week
- YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documenta…☆10,341Jun 8, 2025Updated 8 months ago
- Object detection, 3D detection, and pose estimation using center point detection:☆7,544Mar 2, 2023Updated 3 years ago
- RepVGG: Making VGG-style ConvNets Great Again☆3,458Feb 10, 2023Updated 3 years ago
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆5,120Feb 3, 2026Updated 3 weeks ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,427Feb 24, 2026Updated last week