aws-samples / eks_gpu_and_trainuim_perceiver_io_training
This example shows how to produce multimodal videos with audio using the Kinetics dataset on AWS Trainium and EC2 GPU orchestrated by EKS and launched by Karpenter
☆4Updated 6 months ago
Alternatives and similar repositories for eks_gpu_and_trainuim_perceiver_io_training:
Users that are interested in eks_gpu_and_trainuim_perceiver_io_training are comparing it to the libraries listed below
- Amazon SageMaker operator for Kubernetes☆150Updated last year
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆203Updated last year
- Deep learning benchmark utility and optimization tips on EKS.☆48Updated 5 years ago
- ACK service controller for Amazon SageMaker☆41Updated last week
- Kubeflow workshop on EKS. Mainly focus on AWS integration examples. Please go check kubeflow website http://kubeflow.org for other exampl…☆97Updated 4 years ago
- CSI Driver of Amazon FSx for Lustre https://aws.amazon.com/fsx/lustre/☆132Updated this week
- Argoflow-AWS has been superseded by deployKF☆44Updated last year
- Train and Deploy Machine Learning Models on Kubernetes using Amazon EKS☆163Updated 5 years ago
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆52Updated this week
- Distributed training using Kubeflow on Amazon EKS☆83Updated this week
- Kustomize manifest to deploy kubeflow pipelines in AWS☆21Updated 3 years ago
- ☆43Updated 8 months ago
- SageMaker specific extensions to TensorFlow.☆54Updated 6 months ago
- A Data Platform built for AWS, powered by Kubernetes.☆127Updated last year
- EKS cluster with multiple Spot Node Groups (scale from 0 GPU Node Groups)☆18Updated 5 years ago
- kfctl is a CLI for deploying and managing Kubeflow☆183Updated last year
- A TensorFlow Serving solution for use in SageMaker. This repo is now deprecated.☆172Updated last year
- Fluent Bit plugin-based centralized log analysis across Amazon ECS & EKS clusters☆52Updated 4 years ago
- Deploying EFA in EKS utilizing GPUDirectRDMA where supported☆37Updated 3 months ago
- WARNING: This package has been deprecated. Please use the SageMaker Training Toolkit for model training and the SageMaker Inference Toolk…☆186Updated 4 years ago
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆41Updated this week
- A high performance data access library for machine learning tasks☆74Updated last year
- Repository for makeinga a GitHub Actions for deploying to Kubeflow.☆35Updated 2 years ago
- Amazon SageMaker Managed Spot Training Examples☆50Updated 7 months ago
- Build Train and Deploy your own custom container using AWS StepFunctions Data Science SDK☆23Updated 4 years ago
- ☆66Updated 7 months ago
- Amazon SageMaker Debugger provides functionality to save tensors during training of machine learning jobs and analyze those tensors☆161Updated 8 months ago
- ☆145Updated 2 years ago
- KubeFlow on AWS☆176Updated 3 weeks ago
- This is a sample solution for logging EC2 Spot Instance Interruptions, storing them in CloudWatch and S3, and visualizing them with a Clo…☆62Updated 4 months ago