1duo / awesome-ai-infrastructures
Infrastructures™ for Machine Learning Training/Inference in Production.
☆381Updated 5 years ago
Related projects: ⓘ
- A curated list of awesome Distributed Deep Learning resources.☆393Updated last month
- PyTorch elastic training☆729Updated 2 years ago
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆613Updated 3 weeks ago
- Systems for ML/AI & ML/AI for Systems paper reading list: A curated reading list of computer science research for work at the intersectio…☆268Updated 5 years ago
- Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...☆379Updated last year
- A GPipe implementation in PyTorch☆801Updated last month
- CDF SIG MLOps☆598Updated 2 months ago
- MLModelCI is a complete MLOps platform for managing, converting, profiling, and deploying MLaaS (Machine Learning-as-a-Service), bridging…☆188Updated last year
- Lightweight and Parallel Deep Learning Framework☆261Updated last year
- Coarse-grained lineage and tracing for machine learning pipelines.☆465Updated last year
- Resource-adaptive cluster scheduler for deep learning training.☆422Updated last year
- Kubeflow’s superfood for Data Scientists☆628Updated last year
- A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.☆130Updated 2 years ago
- 🎲 A curated list of MLOps projects, tools and resources☆185Updated 4 months ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆324Updated last week
- Simple Distributed Deep Learning on TensorFlow☆134Updated last year
- Dive into Deep Learning Compiler☆640Updated 2 years ago
- A curated list of awesome edge machine learning resources, including research papers, inference engines, challenges, books, meetups and o…☆247Updated last year
- Unified specification for defining and executing ML workflows, making reproducibility, consistency, and governance easier across the ML p…☆87Updated 5 months ago
- CMU Lecture: Machine Learning In Production / AI Engineering / Software Engineering for AI-Enabled Systems (SE4AI)☆375Updated last year
- ☆375Updated last year
- Deep Learning introduction and its application in various fields☆172Updated 3 years ago
- The Fuzzy Labs guide to the universe of open source MLOps☆441Updated 2 months ago
- Distributed Machine Learning Patterns from Manning Publications by Yuan Tang https://bit.ly/2RKv8Zo☆377Updated last month
- Library for exploring and validating machine learning data☆758Updated last week
- MLOps tutorial using Python, Docker and Kubernetes.☆360Updated last year
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆419Updated last week
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆682Updated 4 years ago
- PyTorch on Kubernetes☆305Updated 2 years ago
- A scalable & efficient active learning/data selection system for everyone.☆212Updated 2 months ago