aws-samples / scalable-hw-agnostic-inferenceLinks
A hardware-agnostic (NVIDIA's GPUs and AWS Inferentia accelerators) deployment of computer-vision models (e.g., YOLO, ViT), generate text and text-to-image (e.g., Llama3 and Stable Diffusion ) on EKS controlled by K8s ingress in routing-time and Karpenter in scheduling-time that is scaled by KEDA.
☆24Updated last month
Alternatives and similar repositories for scalable-hw-agnostic-inference
Users that are interested in scalable-hw-agnostic-inference are comparing it to the libraries listed below
Sorting:
- AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large L…☆84Updated this week
- A simple utility to validate if a given AWS ECS task definition is compatible with Fargate.☆13Updated last year
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆333Updated this week
- Sample microservice based application demonstrating observability capabilities on AWS☆251Updated this week
- ☆52Updated this week
- ☆62Updated last year
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆247Updated 4 months ago
- Cloud-native, AI-powered, document processing pipelines on AWS.☆184Updated 4 months ago
- ☆112Updated this week
- CDK AWS Observability Accelerator☆150Updated 3 weeks ago
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆44Updated 2 months ago
- Build complex, serverless, and highly scalable generative AI applications with prompt chaining.☆296Updated last week
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆46Updated last month
- A demo ChatBot application developed using Amazon Bedrock service's KnowledgeBase, Agent and other AWS's serveless GenAI solution.☆112Updated last month
- MLOps on Amazon EKS☆93Updated 3 weeks ago
- ☆33Updated 2 months ago
- Reference architecture for deployment pipelines☆299Updated 2 months ago
- Patterns repository for the Amazon EKS Bluepints for CDK☆174Updated last week
- ☆48Updated this week
- ☆67Updated last year
- aws-solutions-library-samples / guidance-for-a-multi-tenant-generative-ai-gateway-with-cost-and-usage-tracking-on-awsThis Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like…☆79Updated 9 months ago
- Mistral on AWS examples for Bedrock & SageMaker☆78Updated last week
- This repository provides a reference architecture for building an end to end SaaS solution using Amazon Elastic Kubernetes Service (EKS)☆313Updated 2 weeks ago
- Research and Engineering Studio (RES) is an AWS supported open source product that enables IT administrators to provide an easy-to-use we…☆99Updated 3 weeks ago
- ☆20Updated 3 weeks ago
- ☆86Updated 7 months ago
- This is the public roadmap for AWS Proton☆196Updated 4 years ago
- This repo provides an end to end SaaS reference architecture implementation using Amazon Elastic Container Service (ECS)☆105Updated this week
- Build a custom user interface for more tailored, controlled, and consolidated interactions with Amazon Q business.☆50Updated 9 months ago
- Demonstrate game-server related deployment methods on EKS.☆39Updated 3 months ago