This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It addresses the basic implementation requirements as well as ways you can pack thousands of unique PyTorch deep learning (DL) models into a scalable architecture and evaluate performance
☆45May 29, 2025Updated last year
Alternatives and similar repositories for guidance-for-machine-learning-inference-on-aws
Users that are interested in guidance-for-machine-learning-inference-on-aws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deploy and scale distributed python applications on Amazon EKS using Ray☆20Apr 13, 2026Updated 2 months ago
- AWS DevOps for Docker - a sample project to help you build Docker containers and run them on AWS. In addition to running locally, this p…☆41May 27, 2021Updated 5 years ago
- Openfold inference architecture for Amazon EKS☆11Oct 1, 2024Updated last year
- Create an Amazon EKS cluster and run a distributed training example☆29Aug 19, 2024Updated last year
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆438Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆60Feb 5, 2026Updated 4 months ago
- A do-framework project to simplify deployment of Kubeflow on Amazon EKS☆22Feb 18, 2025Updated last year
- ☆13May 30, 2025Updated last year
- A CLI tool that helps manage training jobs on the SageMaker HyperPod clusters orchestrated by Amazon EKS☆39Jun 11, 2026Updated last week
- Terraform module to configure Datadog AWS integration☆37Oct 1, 2025Updated 8 months ago
- ☆18Nov 13, 2023Updated 2 years ago
- Migrating to Karpenter☆34Mar 25, 2024Updated 2 years ago
- ☆74Jun 26, 2024Updated last year
- Cluster doctor skills☆14May 23, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains sample IaC templates to demonstrate how to leverage Codebuild provisioning with AWS Proton.☆30Feb 12, 2026Updated 4 months ago
- Build and run container environment for LFRic☆11Jan 8, 2024Updated 2 years ago
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆53Jun 17, 2025Updated last year
- ☆34Dec 20, 2024Updated last year
- Tutorials and labs focused on educating users☆11Sep 6, 2023Updated 2 years ago
- ☆12Nov 28, 2024Updated last year
- ☆11Jun 8, 2026Updated last week
- "Docs 2.1" docs-as-code boilerplate☆18Mar 9, 2023Updated 3 years ago
- A really simple Spring Boot app, for demos. Displays information about cheese, for some reason.☆10Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Quickly open links from from kubernetes resources using jsonpath templates.☆14Feb 26, 2026Updated 3 months ago
- File Level Backup of apps running on Openshift / Kubernetes☆15Apr 25, 2019Updated 7 years ago
- This GenAI solution enables users to extract insights from diverse data formats (video, audio, PDFs, text) through a unified interface. U…☆21Feb 12, 2026Updated 4 months ago
- OpenShift Pipelines workshop☆26Jan 4, 2023Updated 3 years ago
- A guidance that provides declarative data processing capability, and workflow orchestration automation to help your business users (such …☆30Aug 29, 2025Updated 9 months ago
- Basic Operator Building Tutorial☆11Jun 17, 2020Updated 6 years ago
- ☆35Apr 9, 2023Updated 3 years ago
- MLOps on AWS using Amazon SageMaker Pipelines☆33Apr 13, 2023Updated 3 years ago
- ☆10Nov 2, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Terraform module to provision AWS Guard Duty☆31May 3, 2026Updated last month
- Galaxy on AWS Guidance provides all the infrastructure components required to run Galaxy in the cloud and are preconfigured with industry…☆19Feb 10, 2025Updated last year
- Template scripts to setup Docker Images compatible with running on MNP Batch☆15Apr 9, 2019Updated 7 years ago
- Development repository for the yum-epel cookbook☆24Jun 10, 2026Updated last week
- Azure Authentication Plugin for Vault☆18May 18, 2026Updated last month
- ☆43Nov 28, 2025Updated 6 months ago
- TwinGraph is a Python framework for distributed container orchestration using Kubernetes clusters, Docker Compose/Swarm or cloud resource…☆35Aug 9, 2024Updated last year