aws-samples / aws-efa-eks
Deploying EFA in EKS utilizing GPUDirectRDMA where supported
☆37Updated 5 months ago
Alternatives and similar repositories for aws-efa-eks:
Users that are interested in aws-efa-eks are comparing it to the libraries listed below
- CSI Driver of Amazon FSx for Lustre https://aws.amazon.com/fsx/lustre/☆130Updated 2 months ago
- Deep learning benchmark utility and optimization tips on EKS.☆48Updated 5 years ago
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆42Updated last year
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆54Updated this week
- This Guidance shows how to build an Amazon Elastic Compute Cloud (Amazon EC2) Spot placement score tracker to monitor unused Amazon EC2 S…☆39Updated 5 months ago
- ACK service controller for Amazon SageMaker☆43Updated last week
- ☆35Updated 3 years ago
- ☆60Updated last week
- Running High Performance Computing (HPA) applications on EKS using Elastic Fabric Adapter (EFA).☆8Updated 3 years ago
- This repository contains tooling used to build the EKS Distro, and all the projects contained in https://github.com/aws/eks-distro.☆80Updated this week
- Testing framework for AWS Controllers for Kubernetes (ACK)☆20Updated last week
- CDK construct for installing and configuring Karpenter on EKS clusters☆42Updated 3 weeks ago
- ACK service controller for Amazon Elastic Kubernetes Service (EKS)☆35Updated last month
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆167Updated last week
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆204Updated last year
- K8s controller implementing Multi-Cluster Services API based on AWS Cloud Map.☆92Updated 3 months ago
- ☆43Updated last month
- Controller for managing Trunk & Branch Network Interfaces on EKS Cluster using Security Group For Pod feature and IPv4 Addresses for Wind…☆89Updated this week
- ☆73Updated this week
- Create and manage Amazon SageMaker HyperPod clusters, run distributed model training☆18Updated last week
- A tool to extend the capabilities of an EKS cluster☆67Updated 4 years ago
- Common ACK runtime and type system☆39Updated last month
- ☆56Updated 8 months ago
- AWS Libfabric☆38Updated last week
- AWS Distro for OpenTelemetry (ADOT) Helm Charts☆45Updated 4 months ago
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆44Updated 8 months ago
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆42Updated last month
- Amazon SageMaker operator for Kubernetes☆149Updated last year
- VPC CNI plugins for Amazon ECS and EKS.☆68Updated last year
- ☆25Updated last year