aws-samples / scalable-hw-agnostic-inferenceLinks
A hardware-agnostic (NVIDIA's GPUs and AWS Inferentia accelerators) deployment of computer-vision models (e.g., YOLO, ViT), generate text and text-to-image (e.g., Llama3 and Stable Diffusion ) on EKS controlled by K8s ingress in routing-time and Karpenter in scheduling-time that is scaled by KEDA.
☆26Updated 2 months ago
Alternatives and similar repositories for scalable-hw-agnostic-inference
Users that are interested in scalable-hw-agnostic-inference are comparing it to the libraries listed below
Sorting:
- AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large L…☆87Updated last week
- ☆25Updated 3 months ago
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆46Updated 3 months ago
- AI on EKS - Tested AI/ML for Amazon Elastic Kubernetes Service☆118Updated this week
- This Guidance shows how to build an Amazon Elastic Compute Cloud (Amazon EC2) Spot placement score tracker to monitor unused Amazon EC2 S…☆41Updated 10 months ago
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆46Updated 2 months ago
- ☆62Updated last year
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆335Updated this week
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆251Updated 4 months ago
- ☆114Updated this week
- ☆20Updated 4 months ago
- ☆52Updated this week
- ☆24Updated 3 months ago
- Cloud-native, AI-powered, document processing pipelines on AWS.☆184Updated 5 months ago
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆60Updated 3 weeks ago
- A simple utility to validate if a given AWS ECS task definition is compatible with Fargate.☆13Updated last year
- Demonstrate game-server related deployment methods on EKS.☆39Updated 4 months ago
- ☆201Updated 3 months ago
- aws-solutions-library-samples / guidance-for-a-multi-tenant-generative-ai-gateway-with-cost-and-usage-tracking-on-awsThis Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like…☆85Updated 10 months ago
- ☆62Updated 2 weeks ago
- The Automated Data Analytics on AWS solution provides an end-to-end data platform for ingesting, transforming, managing and querying data…☆89Updated 11 months ago
- ACK service controller for Amazon SageMaker☆48Updated 2 weeks ago
- ☆18Updated 6 months ago
- MLOps on Amazon EKS☆96Updated 3 weeks ago
- ☆32Updated 7 months ago
- Example code for AWS Neuron SDK developers building inference and training applications☆149Updated last week
- Infrastructure as code for GPU accelerated managed Kubernetes clusters.☆54Updated 4 months ago
- Some crazy experiments☆35Updated this week
- Create and manage Amazon SageMaker HyperPod clusters, run distributed model training☆25Updated 3 weeks ago
- Deploy generative AI agents in your contact center for voice and chat using Amazon Connect, Amazon Lex, and Amazon Bedrock Knowledge Base…☆60Updated 2 months ago