1duo / awesome-ai-infrastructures
Infrastructures™ for Machine Learning Training/Inference in Production.
☆385Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-ai-infrastructures
- A curated list of awesome Distributed Deep Learning resources.☆405Updated 3 months ago
- PyTorch elastic training☆730Updated 2 years ago
- Systems for ML/AI & ML/AI for Systems paper reading list: A curated reading list of computer science research for work at the intersectio…☆269Updated 5 years ago
- A curated list of awesome edge machine learning resources, including research papers, inference engines, challenges, books, meetups and o…☆252Updated last year
- Awesome machine learning model compression research papers, quantization, tools, and learning material.☆491Updated 2 months ago
- Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...☆383Updated 2 years ago
- CMU Lecture: Machine Learning In Production / AI Engineering / Software Engineering for AI-Enabled Systems (SE4AI)☆384Updated last year
- MLModelCI is a complete MLOps platform for managing, converting, profiling, and deploying MLaaS (Machine Learning-as-a-Service), bridging…☆191Updated last year
- A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.☆130Updated 2 years ago
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆626Updated 3 weeks ago
- Deep Learning introduction and its application in various fields☆173Updated 3 years ago
- A curated list of articles that cover the software engineering best practices for building machine learning applications.☆1,242Updated 7 months ago
- Dive into Deep Learning Compiler☆643Updated 2 years ago
- PyTorch on Kubernetes☆307Updated 2 years ago
- A GPipe implementation in PyTorch☆818Updated 3 months ago
- MLOps tutorial using Python, Docker and Kubernetes.☆368Updated last month
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆683Updated 4 years ago
- CDF SIG MLOps☆604Updated this week
- Fabric for Deep Learning (FfDL, pronounced fiddle) is a Deep Learning Platform offering TensorFlow, Caffe, PyTorch etc. as a Service on K…☆692Updated last year
- Kubeflow’s superfood for Data Scientists☆632Updated last year
- Lightweight and Parallel Deep Learning Framework☆263Updated last year
- 🍫 Example code for a basic ML Platform based on Pulumi, FastAPI, DVC, MLFlow and more☆434Updated 3 years ago
- Simple Distributed Deep Learning on TensorFlow☆134Updated 2 years ago
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆433Updated last week
- Resource-adaptive cluster scheduler for deep learning training.☆426Updated last year
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆332Updated this week
- A deep ranking personalization framework☆132Updated last year
- A GPU performance profiling tool for PyTorch models☆495Updated 3 years ago
- Embedded and mobile deep learning research resources☆741Updated last year
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,702Updated 3 months ago