aws-samples / aws-efa-eksLinks
Deploying EFA in EKS utilizing GPUDirectRDMA where supported
☆36Updated last year
Alternatives and similar repositories for aws-efa-eks
Users that are interested in aws-efa-eks are comparing it to the libraries listed below
Sorting:
- CSI Driver of Amazon FSx for Lustre https://aws.amazon.com/fsx/lustre/☆142Updated last month
- Deep learning benchmark utility and optimization tips on EKS.☆47Updated 6 years ago
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆43Updated 2 years ago
- This repository contains tooling used to build the EKS Distro, and all the projects contained in https://github.com/aws/eks-distro.☆83Updated last week
- Create and manage Amazon SageMaker HyperPod clusters, run distributed model training☆24Updated last week
- VPC CNI plugins for Amazon ECS and EKS.☆70Updated 3 months ago
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆64Updated last week
- ☆61Updated 3 weeks ago
- ☆63Updated last year
- ☆67Updated last week
- K8s controller implementing Multi-Cluster Services API based on AWS Cloud Map.☆97Updated 2 months ago
- Tools for testing Kubernetes on AWS☆181Updated this week
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆204Updated 2 years ago
- Amazon SageMaker operator for Kubernetes☆149Updated 2 years ago
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆46Updated 8 months ago
- GenAI inference performance benchmarking tool☆142Updated last week
- ACK service controller for Amazon Elastic Kubernetes Service (EKS)☆42Updated last month
- AI on EKS - Tested AI/ML for Amazon Elastic Kubernetes Service☆156Updated this week
- Controller for managing Trunk & Branch Network Interfaces on EKS Cluster using Security Group For Pod feature and IPv4 Addresses for Wind…☆102Updated this week
- Code generator for AWS Controllers for Kubernetes☆91Updated 3 weeks ago
- NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated compu…☆177Updated this week
- Network Policy Agent is a daemonset that is responsible for enforcing configured network policies on the cluster.☆65Updated this week
- Kubernetes controllers for zone (AZ) aware rollouts and disruptions.☆71Updated 2 years ago
- ☆49Updated this week
- A Topology-Aware Custom Scheduler For Kubernetes☆66Updated 2 years ago
- This repository contains Prow Job configuration for the EKS Distro installation of Prow, which is available at https://prow.eks.amazonaws…☆27Updated last month
- Example DRA driver that developers can fork and modify to get them started writing their own.☆117Updated this week
- ☆34Updated 4 years ago
- Helm charts for llm-d☆52Updated 6 months ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆146Updated this week