huggingface / pytorch-image-modelsLinks
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
☆35,184Updated last week
Alternatives and similar repositories for pytorch-image-models
Users that are interested in pytorch-image-models are comparing it to the libraries listed below
Sorting:
- ☆11,743Updated 6 months ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,155Updated last year
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆23,866Updated 3 weeks ago
- End-to-End Object Detection with Transformers☆14,670Updated last year
- Datasets, Transforms and Models specific to Computer Vision☆17,122Updated this week
- Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125☆15,137Updated 2 months ago
- A PyTorch implementation of EfficientNet☆8,156Updated 3 years ago
- Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.☆10,856Updated last week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,795Updated last week
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆32,701Updated 2 weeks ago
- Code release for ConvNeXt model☆6,119Updated 2 years ago
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆12,145Updated 5 months ago
- OpenMMLab Computer Vision Foundation☆6,242Updated 4 months ago
- Official DeiT repository☆4,259Updated last year
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆7,979Updated last year
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,148Updated last year
- 🐍 Geometric Computer Vision Library for Spatial AI☆10,732Updated this week
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,527Updated 8 months ago
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆5,055Updated 2 months ago
- Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.☆9,098Updated 3 years ago
- OpenMMLab Semantic Segmentation Toolbox and Benchmark.☆9,215Updated last year
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,097Updated 9 months ago
- Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.☆4,300Updated 5 months ago
- CVPR 2025 论文和 开源项目合集☆20,839Updated 2 months ago
- Count the MACs / FLOPs of your PyTorch model.☆5,046Updated last year
- OpenMMLab Detection Toolbox and Benchmark☆31,651Updated last year
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆9,116Updated this week
- The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.☆6,222Updated 3 weeks ago
- Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.☆30,113Updated this week
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.☆23,361Updated last week