aws-solutions-library-samples / Cost_Effective_and_Scalable_Models_Inference_on_AWS_GravitonLinks

comprehensive, scalable ML inference architecture using Amazon EKS, leveraging both Graviton processors for cost-effective CPU-based inference and GPU instances for accelerated inference. Guidance provides a complete end-to-end platform for deploying LLMs with agentic AI capabilities, including RAG and MCP

☆15

Alternatives and similar repositories for Cost_Effective_and_Scalable_Models_Inference_on_AWS_Graviton

Users that are interested in Cost_Effective_and_Scalable_Models_Inference_on_AWS_Graviton are comparing it to the libraries listed below

Sorting:

aws-samples / gen-ai-on-eks
This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…
☆48Updated 3 months ago
aws-samples / genai-model-evaluator
☆39Updated 11 months ago
aws-observability / cdk-aws-observability-accelerator
CDK AWS Observability Accelerator
☆149Updated last month
aristsakpinis93 / generative-ai-immersion-day
☆63Updated 2 years ago
aws-samples / mistral-on-aws
Mistral on AWS examples for Bedrock & SageMaker
☆84Updated last month
aws-samples / custom-web-experience-with-amazon-q-business
Build a custom user interface for more tailored, controlled, and consolidated interactions with Amazon Q business.
☆51Updated 11 months ago
build-on-aws / content
Content repository for Community.aws
☆49Updated 10 months ago
aws-samples / sample-openai-on-aws
OpenAI on AWS examples for Bedrock & SageMaker
☆16Updated 3 weeks ago
aws-solutions-library-samples / guidance-for-a-multi-tenant-generative-ai-gateway-with-cost-and-usage-tracking-on-aws
This Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like…
☆85Updated 11 months ago
aws-solutions-library-samples / guidance-for-deploying-model-context-protocol-servers-on-aws
This Guidance demonstrates how to securely run Model Context Protocol (MCP) servers on the AWS Cloud using containerized architecture. It…
☆110Updated 2 weeks ago
aws / modern-data-architecture-accelerator
☆21Updated 3 weeks ago
aws-samples / foundation-model-benchmarking-tool
Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…
☆254Updated 5 months ago
aws-samples / mlops-terraform-template
☆19Updated last year
aws-samples / generative-ai-applications-foundational-architecture
☆36Updated last year
aws-samples / sample-well-architected-acceleration-with-generative-ai
AWS Well-Architected Framework Review (WAFR) Acceleration with Generative AI (GenAI) sample.
☆77Updated last week
viktoriasemaan / sa-ai-agent
Building SA AI Agent v2
☆79Updated 11 months ago
aws-samples / genai-quickstart-pocs
This repository contains sample code demonstrating various use cases leveraging Amazon Bedrock and Generative AI. Each sample is a separa…
☆383Updated last week
aws-solutions-library-samples / guidance-for-multimodal-data-processing-using-amazon-bedrock-data-automation
This Guidance shows how Amazon Bedrock Data Automation streamlines the generation of valuable insights from unstructured multimodal conte…
☆49Updated 6 months ago
awslabs / genai-bedrock-agent-chatbot
A demo ChatBot application developed using Amazon Bedrock service's KnowledgeBase, Agent and other AWS's serveless GenAI solution.
☆117Updated 2 months ago
aws-samples / multi-region-resiliency-reference-implementation
☆34Updated 3 months ago
aws-samples / secure-data-science-reference-architecture
A collection of recommended practices to accelerate the building of secure data science environments in regulated environments.
☆49Updated 2 years ago
aws-samples / awsome-inference
☆55Updated 2 weeks ago
aws-samples / amazon-sagemaker-secure-mlops
☆89Updated 2 years ago
aws-solutions-library-samples / guidance-for-developing-data-and-ai-foundation-with-amazon-sagemaker
DAIVI is a reference solution with IAC modules to accelerate development of Data, Analytics, AI and Visualization applications on AWS usi…
☆33Updated 2 weeks ago
aws-containers / eks-app-mesh-polyglot-demo
End to end deployment and observability of polyglot microservices in Amazon EKS using AWS App Mesh, AWS Fargate, Amazon Cloudwatch Contai…
☆73Updated 7 months ago
aws-solutions / automated-data-analytics-on-aws
The Automated Data Analytics on AWS solution provides an end-to-end data platform for ingesting, transforming, managing and querying data…
☆90Updated last year
aws-samples / amazon-bedrock-prompting
☆67Updated last year
aws-solutions-library-samples / guidance-for-multi-provider-generative-ai-gateway-on-aws
This Guidance demonstrates how to streamline access to numerous large language models (LLMs) through a unified, industry-standard API gat…
☆160Updated 2 months ago
aws-samples / amazon-sagemaker-studio-vpc-networkfirewall
This solution demonstrates the setup and deployment of Amazon SageMaker Studio into a private VPC and implementation of multi-layer secur…
☆21Updated 3 years ago
aws-samples / multi-tenant-chatbot-using-rag-with-amazon-bedrock
☆81Updated last year