1duo / awesome-ai-infrastructuresLinks
Infrastructures™ for Machine Learning Training/Inference in Production.
☆427Updated 6 years ago
Alternatives and similar repositories for awesome-ai-infrastructures
Users that are interested in awesome-ai-infrastructures are comparing it to the libraries listed below
Sorting:
- A curated list of awesome Distributed Deep Learning resources.☆430Updated last year
- Systems for ML/AI & ML/AI for Systems paper reading list: A curated reading list of computer science research for work at the intersectio…☆280Updated 4 months ago
- PyTorch elastic training☆730Updated 3 years ago
- MLModelCI is a complete MLOps platform for managing, converting, profiling, and deploying MLaaS (Machine Learning-as-a-Service), bridging…☆195Updated 2 years ago
- Deep Learning introduction and its application in various fields☆173Updated 4 years ago
- Resource-adaptive cluster scheduler for deep learning training.☆447Updated 2 years ago
- CDF SIG MLOps☆628Updated 10 months ago
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆661Updated 6 months ago
- Dive into Deep Learning Compiler☆648Updated 3 years ago
- Distributed Machine Learning Patterns from Manning Publications by Yuan Tang https://bit.ly/2RKv8Zo☆465Updated 2 weeks ago
- Unified specification for defining and executing ML workflows, making reproducibility, consistency, and governance easier across the ML p…☆94Updated last year
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆491Updated last month
- CMU Lecture: Machine Learning In Production / AI Engineering / Software Engineering for AI-Enabled Systems (SE4AI)☆434Updated 2 years ago
- Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...☆392Updated 2 years ago
- A curated list of awesome edge machine learning resources, including research papers, inference engines, challenges, books, meetups and o…☆267Updated 2 years ago
- Simple Distributed Deep Learning on TensorFlow☆134Updated 3 months ago
- CS294; AI For Systems and Systems For AI☆225Updated 6 years ago
- A GPU performance profiling tool for PyTorch models☆506Updated 4 years ago
- MLOps tutorial using Python, Docker and Kubernetes.☆400Updated 11 months ago
- A GPipe implementation in PyTorch☆855Updated last year
- Awesome machine learning model compression research papers, quantization, tools, and learning material.☆539Updated last year
- Building Machine Learning Infrastructure!☆45Updated 6 years ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,100Updated 3 weeks ago
- ☆393Updated 2 years ago
- A uniform interface to run deep learning models from multiple frameworks☆940Updated last year
- Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.☆644Updated 3 weeks ago
- A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.☆132Updated 3 years ago
- Lightweight and Parallel Deep Learning Framework☆264Updated 2 years ago
- Train and run Pytorch models on Apache Spark.☆341Updated 2 years ago
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆1,035Updated 3 weeks ago