aws-samples / aws-eks-deep-learning-benchmark
Deep learning benchmark utility and optimization tips on EKS.
☆48Updated 5 years ago
Alternatives and similar repositories for aws-eks-deep-learning-benchmark
Users that are interested in aws-eks-deep-learning-benchmark are comparing it to the libraries listed below
Sorting:
- Amazon SageMaker operator for Kubernetes☆149Updated last year
- CSI Driver of Amazon FSx for Lustre https://aws.amazon.com/fsx/lustre/☆135Updated last week
- Train and Deploy Machine Learning Models on Kubernetes using Amazon EKS☆167Updated 5 years ago
- Deploying EFA in EKS utilizing GPUDirectRDMA where supported☆37Updated 7 months ago
- Kubeflow workshop on EKS. Mainly focus on AWS integration examples. Please go check kubeflow website http://kubeflow.org for other exampl…☆98Updated 4 years ago
- ACK service controller for Amazon SageMaker☆46Updated 2 weeks ago
- Volume Controller for Kubernetes☆67Updated 2 years ago
- This is the documentation for AWS Deep Learning AMIs: your one-stop shop for deep learning in the cloud☆46Updated last year
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆204Updated last year
- Running High Performance Computing (HPA) applications on EKS using Elastic Fabric Adapter (EFA).☆8Updated 4 years ago
- Repository for benchmarking☆78Updated 10 months ago
- Dynamic training with Apache MXNet reduces cost and time for training deep neural networks by leveraging AWS cloud elasticity and scale. …☆56Updated 2 years ago
- Airflow on Kubernetes Operator☆89Updated 2 years ago
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆58Updated 2 weeks ago
- AWS AppMesh sidecar injector for EKS.☆56Updated 4 years ago
- This repository contains tooling used to build the EKS Distro, and all the projects contained in https://github.com/aws/eks-distro.☆81Updated this week
- Incubating project for xgboost operator☆77Updated 3 years ago
- End-to-end solution for cold-start recommendations using vLLM, DeepSeek Llama (8B & 70B), and FAISS on AWS Trainium (Trn1) with the Neuro…☆7Updated last week
- ☆60Updated 2 years ago
- Test infrastructure and tooling for Kubeflow.☆62Updated 3 months ago
- A tool to extend the capabilities of an EKS cluster☆67Updated 4 years ago
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆43Updated last year
- Distributed training using Kubeflow on Amazon EKS☆87Updated this week
- Seldon Core Operator for Kubernetes☆12Updated 5 years ago
- Setup instructions to deploy Google Knative on top of AWS Fargate☆46Updated 4 years ago
- 👩🔬[Experimental] Easily train and serve ML models on Kubernetes, directly from your python code.☆31Updated 6 years ago
- Argoflow-AWS has been superseded by deployKF☆44Updated last year
- Networking Plugins repository for ECS Task Networking☆97Updated 7 months ago
- The Chef cookbook used to build and bootstrap AWS ParallelCluster☆111Updated last week
- *DEPRECATED* Use this instead: https://github.com/aws/eks-charts☆42Updated 5 years ago