aws-samples / scalable-hw-agnostic-inferenceLinks
A hardware-agnostic (NVIDIA's GPUs and AWS Inferentia accelerators) deployment of computer-vision models (e.g., YOLO, ViT), generate text and text-to-image (e.g., Llama3 and Stable Diffusion ) on EKS controlled by K8s ingress in routing-time and Karpenter in scheduling-time that is scaled by KEDA.
☆27Updated 7 months ago
Alternatives and similar repositories for scalable-hw-agnostic-inference
Users that are interested in scalable-hw-agnostic-inference are comparing it to the libraries listed below
Sorting:
- ☆58Updated this week
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆46Updated 8 months ago
- AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large L…☆101Updated last week
- Create and manage Amazon SageMaker HyperPod clusters, run distributed model training☆25Updated last week
- This Guidance shows how to build an Amazon Elastic Compute Cloud (Amazon EC2) Spot placement score tracker to monitor unused Amazon EC2 S…☆48Updated last year
- ☆26Updated 8 months ago
- ☆132Updated last week
- ☆210Updated 2 months ago
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆385Updated last week
- AI on EKS - Tested AI/ML for Amazon Elastic Kubernetes Service☆153Updated last week
- MLOps on Amazon EKS☆118Updated 2 weeks ago
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆255Updated 9 months ago
- ☆76Updated this week
- Demonstrate game-server related deployment methods on EKS.☆39Updated 9 months ago
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆51Updated 7 months ago
- ☆62Updated 2 years ago
- This is a sample solution for logging EC2 Spot Instance Interruptions, storing them in CloudWatch and S3, and visualizing them with a Clo…☆73Updated last year
- ☆51Updated last week
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆43Updated 2 years ago
- Research and Engineering Studio (RES) is an AWS supported open source product that enables IT administrators to provide an easy-to-use we…☆108Updated 2 weeks ago
- Serverless application to monitor an AWS Batch architecture through dashboards.☆63Updated last month
- CDK AWS Observability Accelerator☆151Updated last month
- Cloud-native, AI-powered, document processing pipelines on AWS.☆186Updated last week
- aws-solutions-library-samples / guidance-for-a-multi-tenant-generative-ai-gateway-with-cost-and-usage-tracking-on-awsThis Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like…☆90Updated last year
- Sample microservice based application demonstrating observability capabilities on AWS☆260Updated this week
- ☆27Updated last week
- ☆34Updated 7 months ago
- ACK service controller for Amazon SageMaker☆52Updated 3 weeks ago
- ☆64Updated 3 weeks ago
- This repo provides an end to end SaaS reference architecture implementation using Amazon Elastic Container Service (ECS)☆127Updated last month