aws-samples / aws-do-hyperpodLinks
Create and manage Amazon SageMaker HyperPod clusters, run distributed model training
☆21Updated 2 weeks ago
Alternatives and similar repositories for aws-do-hyperpod
Users that are interested in aws-do-hyperpod are comparing it to the libraries listed below
Sorting:
- ☆47Updated last month
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆59Updated last week
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆43Updated last week
- Run FMBench simultaneously across multiple Amazon EC2 machines to benchmark an FM across multiple serving stacks simultaneously☆14Updated last month
- ☆22Updated 3 weeks ago
- ☆24Updated this week
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆43Updated last year
- ☆10Updated last week
- ☆12Updated 7 months ago
- Create an Amazon EKS cluster and run a distributed training example☆28Updated 9 months ago
- ☆26Updated last week
- This Guidance shows how to build an Amazon Elastic Compute Cloud (Amazon EC2) Spot placement score tracker to monitor unused Amazon EC2 S…☆40Updated 7 months ago
- The repository includes integrations with Amazon Bedrock and its included LLM, such as Amazon Titan and vector and graph database for a R…☆8Updated 6 months ago
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆45Updated 3 weeks ago
- ☆45Updated 3 months ago
- MLOps Pipeline Using SageMaker & CDK, where models are from SageMaker built-in algorithms.☆27Updated 2 months ago
- FM-Leaderboard-er allows you to create leaderboard to find the best LLM/prompt for your own business use case based on your data, task, p…☆18Updated 7 months ago
- ACK service controller for Amazon SageMaker☆47Updated last week
- ☆28Updated last year
- ☆102Updated this week
- CDK construct for installing and configuring Karpenter on EKS clusters☆43Updated this week
- CloudFormation to setup Kubeflow and Sagemaker Operators on EKS☆25Updated 2 years ago
- ☆24Updated last year
- ☆28Updated last year
- Elevate Your DevOps Pipeline with Generative AI☆22Updated 3 months ago
- ☆11Updated last year
- Kubeflow workshop on EKS. Mainly focus on AWS integration examples. Please go check kubeflow website http://kubeflow.org for other exampl…☆98Updated 4 years ago
- AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large L…☆76Updated last week
- Discover how to build agents that can perform actions on websites by combining Amazon Nova Act with Model Context Protocol (MCP).☆43Updated 2 weeks ago
- ☆37Updated 4 months ago