aws-samples / aws-do-hyperpod
Create and manage Amazon SageMaker HyperPod clusters, run distributed model training
☆16Updated this week
Alternatives and similar repositories for aws-do-hyperpod:
Users that are interested in aws-do-hyperpod are comparing it to the libraries listed below
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆52Updated this week
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆41Updated this week
- ☆38Updated this week
- ☆20Updated last week
- CDK construct for installing and configuring Karpenter on EKS clusters☆41Updated 2 weeks ago
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆41Updated last year
- ☆28Updated 10 months ago
- AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large L…☆65Updated this week
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆43Updated 7 months ago
- ☆18Updated last month
- Create an Amazon EKS cluster and run a distributed training example☆28Updated 5 months ago
- This Guidance shows how to build an Amazon Elastic Compute Cloud (Amazon EC2) Spot placement score tracker to monitor unused Amazon EC2 S…☆38Updated 3 months ago
- The repository includes integrations with Amazon Bedrock and its included LLM, such as Amazon Titan and vector and graph database for a R…☆8Updated 2 months ago
- ☆26Updated last year
- ACK service controller for Amazon SageMaker☆41Updated last week
- ☆34Updated last week
- ☆17Updated 2 weeks ago
- Seed-Farmer is an orchestration tool that works with AWS CodeSeeder and acts as an orchestration tool modeled after GitOps deployments. I…☆53Updated this week
- ☆43Updated 8 months ago
- ☆14Updated last year
- Research and Engineering Studio (RES) is an AWS supported open source product that enables IT administrators to provide an easy-to-use we…☆88Updated last month
- Content repository for Community.aws☆48Updated 2 months ago
- ☆30Updated last week
- This is a Red Hat Enterprise Linux specific forked version of the official awslabs amazon-eks-ami repository.☆15Updated 2 weeks ago
- FM-Leaderboard-er allows you to create leaderboard to find the best LLM/prompt for your own business use case based on your data, task, p…☆18Updated 2 months ago
- ☆28Updated 9 months ago
- ☆15Updated last year
- Deploying EFA in EKS utilizing GPUDirectRDMA where supported☆37Updated 3 months ago
- The Alarm Context Tool (ACT) enhances AWS CloudWatch Alarms by providing additional context to aid in troubleshooting and analysis.☆33Updated 7 months ago
- A Data Platform built for AWS, powered by Kubernetes.☆127Updated last year