aws-samples / scalable-hw-agnostic-inferenceLinks
A hardware-agnostic (NVIDIA's GPUs and AWS Inferentia accelerators) deployment of computer-vision models (e.g., YOLO, ViT), generate text and text-to-image (e.g., Llama3 and Stable Diffusion ) on EKS controlled by K8s ingress in routing-time and Karpenter in scheduling-time that is scaled by KEDA.
☆26Updated 4 months ago
Alternatives and similar repositories for scalable-hw-agnostic-inference
Users that are interested in scalable-hw-agnostic-inference are comparing it to the libraries listed below
Sorting:
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆47Updated 6 months ago
- ☆57Updated this week
- AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large L…☆91Updated last week
- ☆126Updated this week
- Demonstrate game-server related deployment methods on EKS.☆39Updated 7 months ago
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆371Updated this week
- AI on EKS - Tested AI/ML for Amazon Elastic Kubernetes Service☆137Updated last week
- MLOps on Amazon EKS☆114Updated last week
- This repository provides a reference architecture for building an end to end SaaS solution using Amazon Elastic Kubernetes Service (EKS)☆324Updated 2 months ago
- Create and manage Amazon SageMaker HyperPod clusters, run distributed model training☆25Updated this week
- Research and Engineering Studio (RES) is an AWS supported open source product that enables IT administrators to provide an easy-to-use we…☆104Updated 2 months ago
- This Guidance shows how to build an Amazon Elastic Compute Cloud (Amazon EC2) Spot placement score tracker to monitor unused Amazon EC2 S…☆48Updated last year
- Reference architecture for deployment pipelines☆301Updated last week
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆254Updated 7 months ago
- ☆19Updated 9 months ago
- ☆40Updated last year
- ☆63Updated last week
- Patterns repository for the Amazon EKS Bluepints for CDK☆173Updated 2 months ago
- ☆208Updated 2 weeks ago
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆51Updated 5 months ago
- CDK AWS Observability Accelerator☆150Updated this week
- ☆26Updated 6 months ago
- aws-solutions-library-samples / guidance-for-a-multi-tenant-generative-ai-gateway-with-cost-and-usage-tracking-on-awsThis Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like…☆89Updated last year
- eINS provides an additional layer of resilience for ECS external instances in deployment scenarios where connectivity to the on-region EC…☆10Updated 2 years ago
- Implementing a fast scaling and low cost Stable Diffusion inference solution with serverless and containers on AWS☆41Updated last year
- The Automated Data Analytics on AWS solution provides an end-to-end data platform for ingesting, transforming, managing and querying data…☆90Updated last year
- Build complex, serverless, and highly scalable generative AI applications with prompt chaining.☆312Updated 3 weeks ago
- ☆62Updated 2 years ago
- ☆56Updated last month
- ☆15Updated last year