aws-solutions-library-samples / guidance-for-scalable-model-inference-and-agentic-ai-on-amazon-eksView external linksLinks
Comprehensive, scalable ML inference architecture using Amazon EKS, leveraging Graviton processors for cost-effective CPU-based inference and GPU instances for accelerated inference. Guidance provides a complete end-to-end platform for deploying LLMs with agentic AI capabilities, including RAG and MCP
☆20Updated this week
Alternatives and similar repositories for guidance-for-scalable-model-inference-and-agentic-ai-on-amazon-eks
Users that are interested in guidance-for-scalable-model-inference-and-agentic-ai-on-amazon-eks are comparing it to the libraries listed below
Sorting:
- A comprehensive toolkit for deploying production-ready Generative AI infrastructure on Amazon EKS. Includes pre-configured components for…☆35Updated this week
- Amazon ECS Auto Scaling for GPU-based Machine Learning Workloads☆17Jan 29, 2024Updated 2 years ago
- This repository provides a deployable solution using Infrastructure-as-Code (IaC) templates with AWS CloudFormation to help you automate …☆11Mar 13, 2024Updated last year
- This Guidance demonstrates how enterprises can unlock the value of their data through the powerful generative AI capabilities of Amazon Q…☆19Jun 24, 2025Updated 7 months ago
- ☆76Feb 10, 2026Updated last week
- Chainlit application built using AWS CDK, secured with Amazon Cognito, that allows you to interact with Anthropic's Claude language model…☆36Jan 28, 2026Updated 2 weeks ago
- Content repository for Community.aws☆49Nov 27, 2024Updated last year
- This Guidance demonstrates how to securely run Model Context Protocol (MCP) servers on the AWS Cloud using containerized architecture. It…☆138Feb 4, 2026Updated last week
- Cloudflare Workers + Alpaca API☆31Mar 14, 2021Updated 4 years ago
- ☆11Nov 30, 2019Updated 6 years ago
- Spatial search using Elastic Search☆12Dec 27, 2014Updated 11 years ago
- Run Eliza plugins inside OpenClaw — wallets, connectors, services, and more☆34Updated this week
- This Guidance shows how to build an Amazon Elastic Compute Cloud (Amazon EC2) Spot placement score tracker to monitor unused Amazon EC2 S…☆48Updated this week
- ⌛ a web app to import raw text and then convert it to linear issues☆14Jun 21, 2025Updated 7 months ago
- The great gaming migration to Linux☆22Dec 17, 2025Updated 2 months ago
- A really simple Spring Boot app, for demos. Displays information about cheese, for some reason.☆10Jan 24, 2026Updated 3 weeks ago
- ☆15May 8, 2025Updated 9 months ago
- ☆10Mar 30, 2020Updated 5 years ago
- Jupyter notebook containing code from text preprocessing blog post☆10Nov 29, 2016Updated 9 years ago
- 3D virtual room where AI agents walk, chat, and collaborate as animated lobster avatars☆33Feb 10, 2026Updated last week
- ☆13Apr 6, 2023Updated 2 years ago
- ☆10Feb 7, 2025Updated last year
- ☆11Jun 1, 2022Updated 3 years ago
- ☆16Jul 20, 2025Updated 6 months ago
- Remote Components demo using Next.js App Router apps☆26Dec 9, 2025Updated 2 months ago
- ☆39Updated this week
- This GenAI solution enables users to extract insights from diverse data formats (video, audio, PDFs, text) through a unified interface. U…☆16Updated this week
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆46May 29, 2025Updated 8 months ago
- Tutorials and labs focused on educating users☆11Sep 6, 2023Updated 2 years ago
- Implementing a fast scaling and low cost Stable Diffusion inference solution with serverless and containers on AWS☆41May 21, 2024Updated last year
- OpenAI on AWS examples for Bedrock & SageMaker☆16Dec 15, 2025Updated 2 months ago
- This Guidance demonstrates how to deploy Cloud Intelligence Dashboards in your AWS environment using AWS CloudFormation templates or comm…☆70Jan 20, 2026Updated 3 weeks ago
- Practical content from 6.419 - Data Analysis: Statistical Modeling and Computation in Applications course from the MicroMasters in Statis…☆10Mar 30, 2021Updated 4 years ago
- An AWS Cloud Development Kit (CDK) sample showing how to configure the aws_lambda extension to send outgoing webhooks via Amazon EventBri…☆16Apr 15, 2025Updated 10 months ago
- ☆12Dec 7, 2024Updated last year
- "Docs 2.1" docs-as-code boilerplate☆18Mar 9, 2023Updated 2 years ago
- a small moltbook running on your own envionment☆26Feb 10, 2026Updated last week
- Basic Operator Building Tutorial☆11Jun 17, 2020Updated 5 years ago
- Awesome List / Resources for Account Abstraction☆11Dec 14, 2022Updated 3 years ago