A CLI tool that helps manage training jobs on the SageMaker HyperPod clusters orchestrated by Amazon EKS
☆33Updated this week
Alternatives and similar repositories for sagemaker-hyperpod-cli
Users that are interested in sagemaker-hyperpod-cli are comparing it to the libraries listed below
Sorting:
- Create and manage Amazon SageMaker HyperPod clusters, run distributed model training☆24Jan 29, 2026Updated last month
- Deploy and scale distributed python applications on Amazon EKS using Ray☆19Feb 2, 2026Updated 3 weeks ago
- Create an Amazon EKS cluster and run a distributed training example☆29Aug 19, 2024Updated last year
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆394Feb 21, 2026Updated last week
- ☆83Feb 17, 2026Updated last week
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆64Feb 20, 2026Updated last week
- ☆12Dec 5, 2019Updated 6 years ago
- CNOE CLI☆10May 9, 2024Updated last year
- Easily stand up Keycloak and SPIRE for testing AI Agents☆29Sep 18, 2025Updated 5 months ago
- Repository for the assignments in the Massive Open Online Course "LAFF-On Programming for Correctness"☆12Apr 24, 2018Updated 7 years ago
- Custom docker container for Catboost on Amazon SageMaker☆13Sep 22, 2022Updated 3 years ago
- ☆11Feb 20, 2026Updated last week
- MIC-CIS entry in PharmaCoNER, Bacteria Biotope (BB 2029) & SeeDev 2019 Shared Tasks in EMNLP '19☆11Feb 22, 2020Updated 6 years ago
- Encode and decode pairs of surrogate characters in Python 3☆10Mar 9, 2022Updated 3 years ago
- Research Artifact For Our Submission To VLDB☆10Oct 27, 2021Updated 4 years ago
- ☆15Jul 17, 2024Updated last year
- ☆13Feb 20, 2025Updated last year
- 命名实体识别☆12Dec 21, 2020Updated 5 years ago
- AWS Neuron Deep Learning Containers (DLCs) are a set of Docker images for training and serving models on AWS Trainium and Inferentia inst…☆21Updated this week
- Collection of different types of transformers for learning purposes☆12Jan 30, 2020Updated 6 years ago
- Official repository for ALT (ALignment with Textual feedback).☆10Jul 25, 2024Updated last year
- Openfold inference architecture for Amazon EKS☆11Oct 1, 2024Updated last year
- A Kubernetes controller designed to oversee Persistent Volume Claims (PVCs) associated with local storage on worker nodes. Its purpose is…☆14Nov 10, 2025Updated 3 months ago
- ☆57Feb 5, 2026Updated 3 weeks ago
- ☆12May 30, 2025Updated 9 months ago
- Proceedings of the annual intercalary robot dance party in celebration of workshop on symposium about 2^6th birthdays; in particular, tha…☆13Mar 20, 2021Updated 4 years ago
- ☆12Nov 28, 2024Updated last year
- A Kubernetes Controller that will ensure that the EC2 Source Destination Check (source-dest-check attribute) is disabled on nodes within …☆18Jul 28, 2020Updated 5 years ago
- 📚 Learn ML with clean code, simplified math and illustrative visuals. As you learn, work on interesting projects and share them on https…☆12Apr 6, 2020Updated 5 years ago
- ☆10Feb 11, 2025Updated last year
- Code repository of the NAACL'21 paper "CoRT: Complementary Rankings from Transformers"☆12Jul 7, 2021Updated 4 years ago
- ☆14Dec 20, 2025Updated 2 months ago
- ☆13Dec 12, 2025Updated 2 months ago
- EKS Managed Node Groups with Placement Group☆12Jul 15, 2024Updated last year
- ☆15Jul 25, 2025Updated 7 months ago
- Sketch colorization.☆14Dec 9, 2018Updated 7 years ago
- ☆14Mar 13, 2021Updated 4 years ago
- ☆14Oct 28, 2025Updated 4 months ago
- Implementation of the Gaussian Process Latent Variable Model.☆14Oct 28, 2022Updated 3 years ago