aws-samples / eks_gpu_and_trainuim_perceiver_io_training
This example shows how to produce multimodal videos with audio using the Kinetics dataset on AWS Trainium and EC2 GPU orchestrated by EKS and launched by Karpenter
☆4Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for eks_gpu_and_trainuim_perceiver_io_training
- Amazon SageMaker operator for Kubernetes☆150Updated last year
- Kubeflow workshop on EKS. Mainly focus on AWS integration examples. Please go check kubeflow website http://kubeflow.org for other exampl…☆97Updated 3 years ago
- ACK service controller for Amazon SageMaker☆41Updated last month
- Deep learning benchmark utility and optimization tips on EKS.☆48Updated 5 years ago
- Argoflow-AWS has been superseded by deployKF☆44Updated last year
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆202Updated 11 months ago
- Train and Deploy Machine Learning Models on Kubernetes using Amazon EKS☆163Updated 5 years ago
- KubeFlow on AWS☆170Updated 2 weeks ago
- A high performance data access library for machine learning tasks☆74Updated 11 months ago
- Distributed training using Kubeflow on Amazon EKS☆82Updated 3 weeks ago
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆49Updated last week
- A Data Platform built for AWS, powered by Kubernetes.☆127Updated last year
- ☆63Updated 4 months ago
- 🚀 Deploy Kubeflow on AWS EKS with Terraform 🤖☆64Updated last year
- CSI Driver of Amazon FSx for Lustre https://aws.amazon.com/fsx/lustre/☆128Updated last week
- Performance optimization for Spark running on Kubernetes☆85Updated 4 years ago
- EKS cluster with multiple Spot Node Groups (scale from 0 GPU Node Groups)☆18Updated 5 years ago
- SageMaker specific extensions to TensorFlow.☆54Updated 3 months ago
- WARNING: This package has been deprecated. Please use the SageMaker Training Toolkit for model training and the SageMaker Inference Toolk…☆186Updated 4 years ago
- Deploying EFA in EKS utilizing GPUDirectRDMA where supported☆37Updated 3 weeks ago
- CLI for building Docker images in SageMaker Studio using AWS CodeBuild.☆57Updated 2 years ago
- Terraform module for creating GKE clusters to run Kubeflow☆213Updated 3 years ago
- ☆41Updated 6 months ago
- kfctl is a CLI for deploying and managing Kubeflow☆181Updated last year
- ☆19Updated 5 years ago
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆39Updated this week
- Serverless application to monitor an AWS Batch architecture through dashboards.☆59Updated 6 months ago
- Toolkit for running MXNet training scripts on SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https://github.c…☆60Updated last year
- Distributed training with SageMaker's script mode using Horovod distributed deep learning framework☆32Updated 4 years ago