1duo / awesome-ai-infrastructuresLinks
Infrastructures™ for Machine Learning Training/Inference in Production.
☆439Updated 6 years ago
Alternatives and similar repositories for awesome-ai-infrastructures
Users that are interested in awesome-ai-infrastructures are comparing it to the libraries listed below
Sorting:
- A curated list of awesome Distributed Deep Learning resources.☆438Updated last year
- Systems for ML/AI & ML/AI for Systems paper reading list: A curated reading list of computer science research for work at the intersectio…☆282Updated 7 months ago
- MLModelCI is a complete MLOps platform for managing, converting, profiling, and deploying MLaaS (Machine Learning-as-a-Service), bridging…☆197Updated 2 years ago
- PyTorch elastic training☆728Updated 3 years ago
- Distributed Machine Learning Patterns from Manning Publications by Yuan Tang https://bit.ly/2RKv8Zo☆488Updated 3 weeks ago
- Resource-adaptive cluster scheduler for deep learning training.☆451Updated 2 years ago
- Dive into Deep Learning Compiler☆645Updated 3 years ago
- Deep Learning introduction and its application in various fields☆175Updated 5 years ago
- A curated list of awesome edge machine learning resources, including research papers, inference engines, challenges, books, meetups and o…☆270Updated 2 years ago
- CDF SIG MLOps☆632Updated last year
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆670Updated last week
- ☆391Updated 3 years ago
- Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...☆394Updated 3 years ago
- CS294; AI For Systems and Systems For AI☆227Updated 6 years ago
- A GPipe implementation in PyTorch☆862Updated last year
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆503Updated 2 weeks ago
- Simple Distributed Deep Learning on TensorFlow☆134Updated 7 months ago
- MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.☆158Updated 2 months ago
- This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.☆451Updated last year
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆1,041Updated 4 months ago
- A GPU performance profiling tool for PyTorch models☆509Updated 4 years ago
- A uniform interface to run deep learning models from multiple frameworks☆941Updated 2 years ago
- Unified specification for defining and executing ML workflows, making reproducibility, consistency, and governance easier across the ML p…☆94Updated last year
- PyTorch on Kubernetes☆309Updated 4 years ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,135Updated 3 months ago
- Curating a list of AutoML-related research, tools, projects and other resources☆910Updated 5 months ago
- CMU Lecture: Machine Learning In Production / AI Engineering / Software Engineering for AI-Enabled Systems (SE4AI)☆441Updated 2 years ago
- Bagua Speeds up PyTorch☆884Updated last year
- Lightweight and Parallel Deep Learning Framework☆263Updated 3 years ago
- MLOps tutorial using Python, Docker and Kubernetes.☆407Updated last year