aws-solutions-library-samples / Cost_Effective_and_Scalable_Models_Inference_on_AWS_GravitonLinks
comprehensive, scalable ML inference architecture using Amazon EKS, leveraging both Graviton processors for cost-effective CPU-based inference and GPU instances for accelerated inference. Guidance provides a complete end-to-end platform for deploying LLMs with agentic AI capabilities, including RAG and MCP
☆15Updated last week
Alternatives and similar repositories for Cost_Effective_and_Scalable_Models_Inference_on_AWS_Graviton
Users that are interested in Cost_Effective_and_Scalable_Models_Inference_on_AWS_Graviton are comparing it to the libraries listed below
Sorting:
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆48Updated 3 months ago
- ☆39Updated 11 months ago
- CDK AWS Observability Accelerator☆149Updated last month
- ☆63Updated 2 years ago
- Mistral on AWS examples for Bedrock & SageMaker☆84Updated last month
- Build a custom user interface for more tailored, controlled, and consolidated interactions with Amazon Q business.☆51Updated 11 months ago
- Content repository for Community.aws☆49Updated 10 months ago
- OpenAI on AWS examples for Bedrock & SageMaker☆16Updated 3 weeks ago
- aws-solutions-library-samples / guidance-for-a-multi-tenant-generative-ai-gateway-with-cost-and-usage-tracking-on-awsThis Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like…☆85Updated 11 months ago
- This Guidance demonstrates how to securely run Model Context Protocol (MCP) servers on the AWS Cloud using containerized architecture. It…☆110Updated 2 weeks ago
- ☆21Updated 3 weeks ago
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆254Updated 5 months ago
- ☆19Updated last year
- ☆36Updated last year
- AWS Well-Architected Framework Review (WAFR) Acceleration with Generative AI (GenAI) sample.☆77Updated last week
- Building SA AI Agent v2☆79Updated 11 months ago
- This repository contains sample code demonstrating various use cases leveraging Amazon Bedrock and Generative AI. Each sample is a separa…☆383Updated last week
- aws-solutions-library-samples / guidance-for-multimodal-data-processing-using-amazon-bedrock-data-automationThis Guidance shows how Amazon Bedrock Data Automation streamlines the generation of valuable insights from unstructured multimodal conte…☆49Updated 6 months ago
- A demo ChatBot application developed using Amazon Bedrock service's KnowledgeBase, Agent and other AWS's serveless GenAI solution.☆117Updated 2 months ago
- ☆34Updated 3 months ago
- A collection of recommended practices to accelerate the building of secure data science environments in regulated environments.☆49Updated 2 years ago
- ☆55Updated 2 weeks ago
- ☆89Updated 2 years ago
- aws-solutions-library-samples / guidance-for-developing-data-and-ai-foundation-with-amazon-sagemakerDAIVI is a reference solution with IAC modules to accelerate development of Data, Analytics, AI and Visualization applications on AWS usi…☆33Updated 2 weeks ago
- End to end deployment and observability of polyglot microservices in Amazon EKS using AWS App Mesh, AWS Fargate, Amazon Cloudwatch Contai…☆73Updated 7 months ago
- The Automated Data Analytics on AWS solution provides an end-to-end data platform for ingesting, transforming, managing and querying data…☆90Updated last year
- ☆67Updated last year
- This Guidance demonstrates how to streamline access to numerous large language models (LLMs) through a unified, industry-standard API gat…☆160Updated 2 months ago
- This solution demonstrates the setup and deployment of Amazon SageMaker Studio into a private VPC and implementation of multi-layer secur…☆21Updated 3 years ago
- ☆81Updated last year