pytorch / serveLinks

Serve, optimize and scale PyTorch models in production

☆4,353

Alternatives and similar repositories for serve

Users that are interested in serve are comparing it to the libraries listed below

Sorting:

triton-inference-server / server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
☆10,060Updated this week
facebookresearch / fairscale
PyTorch extensions for high performance and large scale training.
☆3,387Updated 7 months ago
pytorch / TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
☆2,888Updated last week
pytorch / xla
Enabling PyTorch on XLA Devices (e.g. Google TPU)
☆2,704Updated last week
determined-ai / determined
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, …
☆3,198Updated 8 months ago
ELS-RD / transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
☆1,688Updated last year
pytorch / ignite
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
☆4,714Updated last week
Lightning-AI / torchmetrics
Machine learning metrics for distributed, scalable PyTorch applications.
☆2,364Updated 2 weeks ago
bentoml / BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
☆8,249Updated last week
huggingface / optimum
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…
☆3,188Updated last week
kserve / kserve
Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes
☆4,811Updated this week
SeldonIO / seldon-core
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
☆4,675Updated last week
clearml / clearml
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling …
☆6,375Updated this week
huggingface / accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…
☆9,307Updated this week
microsoft / hummingbird
Hummingbird compiles trained ML models into tensor computation for faster inference.
☆3,499Updated 4 months ago
onnx / tutorials
Tutorials for creating and using ONNX models
☆3,628Updated last year
uber / petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…
☆1,868Updated 3 weeks ago
Lightning-Universe / lightning-bolts
Toolbox of models, callbacks, and datasets for AI/ML researchers.
☆1,748Updated 3 weeks ago
ShannonAI / service-streamer
Boosting your Web Services of Deep Learning Applications.
☆1,244Updated 4 years ago
meta-pytorch / captum
Model interpretability and understanding for PyTorch
☆5,468Updated last week
polyaxon / polyaxon
MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
☆3,684Updated 2 weeks ago
webdataset / webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
☆2,885Updated 5 months ago
onnx / tensorflow-onnx
Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
☆2,495Updated 2 months ago
triton-inference-server / pytriton
PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
☆827Updated 3 months ago
rapidsai / cuml
cuML - RAPIDS Machine Learning Library
☆5,020Updated this week
NVIDIA / apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
☆8,854Updated last week
NVIDIA / DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…
☆5,562Updated this week
NVIDIA / FasterTransformer
Transformer related optimization, including BERT, GPT
☆6,354Updated last year
NVIDIA-AI-IOT / torch2trt
An easy to use PyTorch to TensorRT converter
☆4,833Updated last year
catalyst-team / catalyst
Accelerated deep learning R&D
☆3,364Updated 5 months ago