This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It addresses the basic implementation requirements as well as ways you can pack thousands of unique PyTorch deep learning (DL) models into a scalable architecture and evaluate performance
☆46May 29, 2025Updated 10 months ago
Alternatives and similar repositories for guidance-for-machine-learning-inference-on-aws
Users that are interested in guidance-for-machine-learning-inference-on-aws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deploy and scale distributed python applications on Amazon EKS using Ray☆19Updated this week
- Create and manage Amazon SageMaker HyperPod clusters, run distributed model training☆24Jan 29, 2026Updated 2 months ago
- AWS DevOps for Docker - a sample project to help you build Docker containers and run them on AWS. In addition to running locally, this p…☆41May 27, 2021Updated 4 years ago
- Comprehensive, scalable ML inference architecture using Amazon EKS, leveraging Graviton processors for cost-effective CPU-based inference…☆21Mar 12, 2026Updated last month
- Openfold inference architecture for Amazon EKS☆11Oct 1, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Create an Amazon EKS cluster and run a distributed training example☆29Aug 19, 2024Updated last year
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆407Updated this week
- ☆60Feb 5, 2026Updated 2 months ago
- A do-framework project to simplify deployment of Kubeflow on Amazon EKS☆22Feb 18, 2025Updated last year
- ☆12May 30, 2025Updated 10 months ago
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆66Mar 30, 2026Updated 2 weeks ago
- A CLI tool that helps manage training jobs on the SageMaker HyperPod clusters orchestrated by Amazon EKS☆35Apr 9, 2026Updated last week
- Terraform module to configure Datadog AWS integration☆37Oct 1, 2025Updated 6 months ago
- demo of keyless signing with the sigstore kubernetes policy controller☆11Sep 7, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆18Nov 13, 2023Updated 2 years ago
- Migrating to Karpenter☆34Mar 25, 2024Updated 2 years ago
- ☆74Jun 26, 2024Updated last year
- This Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like…☆92Oct 20, 2024Updated last year
- This Guidance helps software companies set up a automated system to detect error logs, generate bug fixes, and create pull requests. Help…☆29Oct 20, 2024Updated last year
- This Guidance demonstrates how to streamline access to numerous large language models (LLMs) through a unified, industry-standard API gat…☆214Apr 3, 2026Updated 2 weeks ago
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆52Jun 17, 2025Updated 10 months ago
- Tutorials and labs focused on educating users☆11Sep 6, 2023Updated 2 years ago
- ☆13Apr 6, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12Nov 28, 2024Updated last year
- GitHub Actions, Flux, and Amazon EKS ✨☆24Jul 8, 2022Updated 3 years ago
- ☆11Nov 26, 2024Updated last year
- A really simple Spring Boot app, for demos. Displays information about cheese, for some reason.☆10Mar 31, 2026Updated 2 weeks ago
- File Level Backup of apps running on Openshift / Kubernetes☆15Apr 25, 2019Updated 6 years ago
- This GenAI solution enables users to extract insights from diverse data formats (video, audio, PDFs, text) through a unified interface. U…☆17Feb 12, 2026Updated 2 months ago
- OpenShift Pipelines workshop☆26Jan 4, 2023Updated 3 years ago
- A guidance that provides declarative data processing capability, and workflow orchestration automation to help your business users (such …☆30Aug 29, 2025Updated 7 months ago
- ☆35Apr 9, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MLOps on AWS using Amazon SageMaker Pipelines☆33Apr 13, 2023Updated 3 years ago
- Workshop on building an operator.☆14Jul 27, 2020Updated 5 years ago
- Repository with the code for running deep learning inference benchmarks on different AWS instances and service types.☆14Nov 19, 2024Updated last year
- Galaxy on AWS Guidance provides all the infrastructure components required to run Galaxy in the cloud and are preconfigured with industry…☆18Feb 10, 2025Updated last year
- ☆15Jun 6, 2025Updated 10 months ago
- Azure Authentication Plugin for Vault☆18Apr 1, 2026Updated 2 weeks ago
- ☆41Nov 28, 2025Updated 4 months ago