This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It addresses the basic implementation requirements as well as ways you can pack thousands of unique PyTorch deep learning (DL) models into a scalable architecture and evaluate performance
☆46May 29, 2025Updated last year
Alternatives and similar repositories for guidance-for-machine-learning-inference-on-aws
Users that are interested in guidance-for-machine-learning-inference-on-aws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deploy and scale distributed python applications on Amazon EKS using Ray☆20Apr 13, 2026Updated last month
- Create and manage Amazon SageMaker HyperPod clusters, run distributed model training☆24May 21, 2026Updated last week
- AWS DevOps for Docker - a sample project to help you build Docker containers and run them on AWS. In addition to running locally, this p…☆41May 27, 2021Updated 5 years ago
- Openfold inference architecture for Amazon EKS☆11Oct 1, 2024Updated last year
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆427May 20, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆13May 30, 2025Updated 11 months ago
- A CLI tool that helps manage training jobs on the SageMaker HyperPod clusters orchestrated by Amazon EKS☆37May 11, 2026Updated 2 weeks ago
- demo of keyless signing with the sigstore kubernetes policy controller☆11Sep 7, 2022Updated 3 years ago
- ☆74Jun 26, 2024Updated last year
- Cluster doctor skills☆14May 23, 2026Updated last week
- This Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like…☆94Oct 20, 2024Updated last year
- This Guidance helps software companies set up a automated system to detect error logs, generate bug fixes, and create pull requests. Help…☆29Oct 20, 2024Updated last year
- This repository contains sample IaC templates to demonstrate how to leverage Codebuild provisioning with AWS Proton.☆30Feb 12, 2026Updated 3 months ago
- Build and run container environment for LFRic☆11Jan 8, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This Guidance demonstrates how to streamline access to numerous large language models (LLMs) through a unified, industry-standard API gat…☆221Apr 24, 2026Updated last month
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆53Jun 17, 2025Updated 11 months ago
- ☆34Dec 20, 2024Updated last year
- ☆12Nov 28, 2024Updated last year
- ☆13Apr 6, 2023Updated 3 years ago
- ☆11Nov 26, 2024Updated last year
- A really simple Spring Boot app, for demos. Displays information about cheese, for some reason.☆10Apr 30, 2026Updated last month
- File Level Backup of apps running on Openshift / Kubernetes☆15Apr 25, 2019Updated 7 years ago
- This GenAI solution enables users to extract insights from diverse data formats (video, audio, PDFs, text) through a unified interface. U…☆20Feb 12, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- OpenShift Pipelines workshop☆26Jan 4, 2023Updated 3 years ago
- A guidance that provides declarative data processing capability, and workflow orchestration automation to help your business users (such …☆30Aug 29, 2025Updated 9 months ago
- Basic Operator Building Tutorial☆11Jun 17, 2020Updated 5 years ago
- MLOps on AWS using Amazon SageMaker Pipelines☆33Apr 13, 2023Updated 3 years ago
- Workshop on building an operator.☆14Jul 27, 2020Updated 5 years ago
- ☆10Nov 2, 2023Updated 2 years ago
- Terraform module to provision AWS Guard Duty☆31May 3, 2026Updated 3 weeks ago
- ☆15Jun 6, 2025Updated 11 months ago
- Galaxy on AWS Guidance provides all the infrastructure components required to run Galaxy in the cloud and are preconfigured with industry…☆19Feb 10, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Template scripts to setup Docker Images compatible with running on MNP Batch☆15Apr 9, 2019Updated 7 years ago
- Development repository for the yum-epel cookbook☆24May 5, 2026Updated 3 weeks ago
- Azure Authentication Plugin for Vault☆18May 18, 2026Updated last week
- ☆43Nov 28, 2025Updated 6 months ago
- TwinGraph is a Python framework for distributed container orchestration using Kubernetes clusters, Docker Compose/Swarm or cloud resource…☆34Aug 9, 2024Updated last year
- ☆10May 14, 2026Updated 2 weeks ago
- Data validation rules engine app to easily codify corporate data standards☆20Feb 18, 2026Updated 3 months ago