aws-solutions-library-samples / guidance-for-scalable-model-inference-and-agentic-ai-on-amazon-eksView on GitHub
Comprehensive, scalable ML inference architecture using Amazon EKS, leveraging Graviton processors for cost-effective CPU-based inference and GPU instances for accelerated inference. Guidance provides a complete end-to-end platform for deploying LLMs with agentic AI capabilities, including RAG and MCP
☆21Mar 12, 2026Updated 2 weeks ago
Alternatives and similar repositories for guidance-for-scalable-model-inference-and-agentic-ai-on-amazon-eks
Users that are interested in guidance-for-scalable-model-inference-and-agentic-ai-on-amazon-eks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository provides a deployable solution using Infrastructure-as-Code (IaC) templates with AWS CloudFormation to help you automate …☆11Mar 13, 2024Updated 2 years ago
- Spatial search using Elastic Search☆12Dec 27, 2014Updated 11 years ago
- Amazon ECS Auto Scaling for GPU-based Machine Learning Workloads☆18Jan 29, 2024Updated 2 years ago
- ☆16Jun 25, 2024Updated last year
- A comprehensive toolkit for deploying production-ready Generative AI infrastructure on Amazon EKS. Includes pre-configured components for…☆47Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆46May 29, 2025Updated 10 months ago
- Implementing a fast scaling and low cost Stable Diffusion inference solution with serverless and containers on AWS☆41May 21, 2024Updated last year
- The all encompassing boilerplate to get you going quickly with CraftCMS/Bootstrap/Sass/Vue.js and more.☆14Feb 6, 2021Updated 5 years ago
- ☆28Dec 19, 2024Updated last year
- ☆10Mar 16, 2026Updated 2 weeks ago
- This Guidance demonstrates how organizations can implement secure enterprise authentication for Amazon Bedrock using industry-standard pr…☆195Updated this week
- Tutorials and labs focused on educating users☆11Sep 6, 2023Updated 2 years ago
- ☆13Apr 6, 2023Updated 2 years ago
- Content repository for Community.aws☆49Nov 27, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This Guidance demonstrates how enterprises can unlock the value of their data through the powerful generative AI capabilities of Amazon Q…☆19Jun 24, 2025Updated 9 months ago
- Chainlit application built using AWS CDK, secured with Amazon Cognito, that allows you to interact with Anthropic's Claude language model…☆36Feb 11, 2026Updated last month
- "Docs 2.1" docs-as-code boilerplate☆17Mar 9, 2023Updated 3 years ago
- A really simple Spring Boot app, for demos. Displays information about cheese, for some reason.☆10Feb 20, 2026Updated last month
- ☆14May 19, 2023Updated 2 years ago
- This Guidance shows how to build an Amazon Elastic Compute Cloud (Amazon EC2) Spot placement score tracker to monitor unused Amazon EC2 S…☆48Feb 12, 2026Updated last month
- ☆11Nov 30, 2019Updated 6 years ago
- File Level Backup of apps running on Openshift / Kubernetes☆15Apr 25, 2019Updated 6 years ago
- This GenAI solution enables users to extract insights from diverse data formats (video, audio, PDFs, text) through a unified interface. U…☆17Feb 12, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆77Mar 10, 2026Updated 3 weeks ago
- Jupyter notebook containing code from text preprocessing blog post☆10Nov 29, 2016Updated 9 years ago
- This Guidance demonstrates how to streamline access to numerous large language models (LLMs) through a unified, industry-standard API gat…☆209Updated this week
- This Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like…☆90Oct 20, 2024Updated last year
- Basic Operator Building Tutorial☆11Jun 17, 2020Updated 5 years ago
- Workshop on building an operator.☆14Jul 27, 2020Updated 5 years ago
- An AWS Cloud Development Kit (CDK) sample showing how to configure the aws_lambda extension to send outgoing webhooks via Amazon EventBri…☆16Feb 25, 2026Updated last month
- ☆13Dec 15, 2025Updated 3 months ago
- Awesome List / Resources for Account Abstraction☆11Dec 14, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆15May 8, 2025Updated 10 months ago
- This project shows how to implement a simple SaaS metering system on AWS☆14Apr 15, 2025Updated 11 months ago
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆52Jun 17, 2025Updated 9 months ago
- Run Haystack Pipelines on Ray☆20Oct 16, 2024Updated last year
- ☆11Jun 1, 2022Updated 3 years ago
- This project is an example of using AWS Step functions to manage and track a series of AWS Batch jobs in N_TO_N mode.☆15Jan 20, 2026Updated 2 months ago
- Deploy and manage a self-hosted LLM using EKS.