Create and manage Amazon SageMaker HyperPod clusters, run distributed model training
☆24Jan 29, 2026Updated last month
Alternatives and similar repositories for aws-do-hyperpod
Users that are interested in aws-do-hyperpod are comparing it to the libraries listed below
Sorting:
- A CLI tool that helps manage training jobs on the SageMaker HyperPod clusters orchestrated by Amazon EKS☆33Updated this week
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆403Mar 13, 2026Updated last week
- Create an Amazon EKS cluster and run a distributed training example☆29Aug 19, 2024Updated last year
- Deploy and scale distributed python applications on Amazon EKS using Ray☆19Updated this week
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆46May 29, 2025Updated 9 months ago
- Do Framework Definition☆17Sep 13, 2024Updated last year
- AWS DevOps for Docker - a sample project to help you build Docker containers and run them on AWS. In addition to running locally, this p…☆41May 27, 2021Updated 4 years ago
- Openfold inference architecture for Amazon EKS☆11Oct 1, 2024Updated last year
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆64Mar 14, 2026Updated last week
- A do-framework project to simplify deployment of Kubeflow on Amazon EKS☆22Feb 18, 2025Updated last year
- Tutorials and labs focused on educating users☆11Sep 6, 2023Updated 2 years ago
- ☆13Apr 6, 2023Updated 2 years ago
- Comprehensive, scalable ML inference architecture using Amazon EKS, leveraging Graviton processors for cost-effective CPU-based inference…☆21Mar 12, 2026Updated last week
- TP-PARSEC: A Task Parallel PARSEC Benchmark Suite☆11Nov 1, 2020Updated 5 years ago
- Basic Operator Building Tutorial☆11Jun 17, 2020Updated 5 years ago
- Workshop on building an operator.☆14Jul 27, 2020Updated 5 years ago
- ☆58Feb 5, 2026Updated last month
- AWS KMS External Keystore (XKS) Proxy API specification☆26Mar 10, 2026Updated last week
- TwinGraph is a Python framework for distributed container orchestration using Kubernetes clusters, Docker Compose/Swarm or cloud resource…☆34Aug 9, 2024Updated last year
- ☆15Sep 28, 2020Updated 5 years ago
- Run MPI programs over tmux☆15Mar 14, 2024Updated 2 years ago
- Custom::LexBot | AWS CloudFormation Custom Lambda Resource | Lex Bot☆10Jan 13, 2021Updated 5 years ago
- ☆13May 7, 2021Updated 4 years ago
- Making the transition from Scratch to Python☆11Apr 11, 2017Updated 8 years ago
- Experiment management with Hydra and MLflow☆13Nov 20, 2020Updated 5 years ago
- This open-source project delivers a complete pipeline for converting multi-page documents (PDFs/images) into structured JSON using Vision…☆15Aug 4, 2025Updated 7 months ago
- "유닉스 리눅스 셸 스크립트 예제 사전: Unix & Linux Shell Script Exercise Dictionary" - 한빛미디어☆10Jan 17, 2017Updated 9 years ago
- Heterogeneous Active Messages C++ library☆21Nov 8, 2019Updated 6 years ago
- ☆11Mar 16, 2021Updated 5 years ago
- These scripts clean the unused EBS volumes, AMIs and snapshots on Amazon Web Services.☆11Jul 24, 2015Updated 10 years ago
- Transfer code to LaTeX with your colorscheme in Vim☆19Jun 2, 2019Updated 6 years ago
- ☆86Mar 5, 2026Updated 2 weeks ago
- Typescript overlay to easily interact with DynamoDB. Fluid syntax library offering pagination, auto-escape of reserved words and many mor…☆11Jul 15, 2024Updated last year
- A vision-language model with an improved cross-attention mechanism for scalable streaming inference☆28Mar 9, 2026Updated last week
- ☆12May 30, 2025Updated 9 months ago
- An example project showing how to enable tiered compilation on a Java AWS Lambda function.☆21Mar 16, 2022Updated 4 years ago
- Utility which provides a UI to do prompt engineering within SageMaker Studio.☆14Jul 5, 2023Updated 2 years ago
- CSI Driver of Amazon FSx for Lustre https://aws.amazon.com/fsx/lustre/☆141Mar 12, 2026Updated last week
- AWS Neuron Deep Learning Containers (DLCs) are a set of Docker images for training and serving models on AWS Trainium and Inferentia inst…☆21Feb 27, 2026Updated 3 weeks ago