aws-samples / scalable-hw-agnostic-inferenceLinks
A hardware-agnostic (NVIDIA's GPUs and AWS Inferentia accelerators) deployment of computer-vision models (e.g., YOLO, ViT), generate text and text-to-image (e.g., Llama3 and Stable Diffusion ) on EKS controlled by K8s ingress in routing-time and Karpenter in scheduling-time that is scaled by KEDA.
☆26Updated 3 months ago
Alternatives and similar repositories for scalable-hw-agnostic-inference
Users that are interested in scalable-hw-agnostic-inference are comparing it to the libraries listed below
Sorting:
- AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large L…☆90Updated 3 weeks ago
- ☆55Updated last month
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆47Updated 4 months ago
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆48Updated 4 months ago
- Sample microservice based application demonstrating observability capabilities on AWS☆257Updated this week
- A simple utility to validate if a given AWS ECS task definition is compatible with Fargate.☆13Updated 2 years ago
- Cloud-native, AI-powered, document processing pipelines on AWS.☆186Updated 7 months ago
- ☆25Updated 5 months ago
- aws-solutions-library-samples / guidance-for-a-multi-tenant-generative-ai-gateway-with-cost-and-usage-tracking-on-awsThis Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like…☆85Updated 11 months ago
- AI on EKS - Tested AI/ML for Amazon Elastic Kubernetes Service☆130Updated last week
- ☆120Updated this week
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆254Updated 6 months ago
- This Guidance shows how to build an Amazon Elastic Compute Cloud (Amazon EC2) Spot placement score tracker to monitor unused Amazon EC2 S…☆42Updated 11 months ago
- Demonstrate game-server related deployment methods on EKS.☆39Updated 5 months ago
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆353Updated this week
- Build complex, serverless, and highly scalable generative AI applications with prompt chaining.☆307Updated 2 weeks ago
- ☆205Updated 4 months ago
- CDK construct for installing and configuring Karpenter on EKS clusters☆45Updated 2 weeks ago
- CDK AWS Observability Accelerator☆149Updated 2 months ago
- ☆62Updated last month
- MLOps on Amazon EKS☆102Updated 2 weeks ago
- ☆32Updated 8 months ago
- This repository provides a reference architecture for building an end to end SaaS solution using Amazon Elastic Kubernetes Service (EKS)☆323Updated 3 weeks ago
- ☆23Updated this week
- Workshop Studio☆156Updated last year