aws-samples / aws-eks-deep-learning-benchmark
Deep learning benchmark utility and optimization tips on EKS.
☆48Updated 5 years ago
Related projects: ⓘ
- Amazon SageMaker operator for Kubernetes☆149Updated last year
- Deploying EFA in EKS utilizing GPUDirectRDMA where supported☆35Updated 3 months ago
- CSI Driver of Amazon FSx for Lustre https://aws.amazon.com/fsx/lustre/☆124Updated 3 weeks ago
- Train and Deploy Machine Learning Models on Kubernetes using Amazon EKS☆163Updated 5 years ago
- Kubeflow workshop on EKS. Mainly focus on AWS integration examples. Please go check kubeflow website http://kubeflow.org for other exampl…☆97Updated 3 years ago
- ACK service controller for Amazon SageMaker☆39Updated last week
- Distributed training using Kubeflow on Amazon EKS☆79Updated last week
- Running High Performance Computing (HPA) applications on EKS using Elastic Fabric Adapter (EFA).☆8Updated 3 years ago
- This is the documentation for AWS Deep Learning AMIs: your one-stop shop for deep learning in the cloud☆47Updated last year
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆40Updated last year
- This example shows how to produce multimodal videos with audio using the Kinetics dataset on AWS Trainium and EC2 GPU orchestrated by EKS…☆4Updated last month
- Dynamic training with Apache MXNet reduces cost and time for training deep neural networks by leveraging AWS cloud elasticity and scale. …☆56Updated last year
- ☆46Updated last week
- A Data Platform built for AWS, powered by Kubernetes.☆127Updated last year
- Repository for benchmarking☆77Updated 3 months ago
- Amazon EKS cluster consumption made easier☆31Updated 2 years ago
- A tool to extend the capabilities of an EKS cluster☆67Updated 4 years ago
- Toolkit for allowing inference and serving with MXNet in SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https…☆28Updated last year
- This Guidance shows how to build an Amazon Elastic Compute Cloud (Amazon EC2) Spot placement score tracker to monitor unused Amazon EC2 S…☆37Updated 5 months ago
- Re:Invent Inf1 Instance Lab☆22Updated 4 years ago
- KubeFlow on AWS☆164Updated last month
- Incubating project for xgboost operator☆76Updated 2 years ago
- The Chef cookbook used to build and bootstrap AWS ParallelCluster☆107Updated this week
- ☆34Updated 6 years ago
- ☆60Updated last year
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆200Updated 9 months ago
- ☆40Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆91Updated last year
- Argoflow-AWS has been superseded by deployKF☆44Updated last year
- kfctl is a CLI for deploying and managing Kubeflow☆181Updated last year