aws-samples / awsome-inference
☆45Updated 2 months ago
Alternatives and similar repositories for awsome-inference:
Users that are interested in awsome-inference are comparing it to the libraries listed below
- This Guidance demonstrates how to deploy a machine learning inference architecture on Amazon Elastic Kubernetes Service (Amazon EKS). It …☆42Updated 2 months ago
- Mistral on AWS examples for Bedrock & SageMaker☆66Updated this week
- ☆90Updated this week
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆55Updated 3 weeks ago
- ☆61Updated 9 months ago
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆44Updated 9 months ago
- Some crazy experiments☆33Updated 2 months ago
- ☆21Updated last month
- ☆25Updated last week
- aws-solutions-library-samples / guidance-for-a-multi-tenant-generative-ai-gateway-with-cost-and-usage-tracking-on-awsThis Guidance demonstrates how to build an internal Software-as-a-Service (SaaS) platform that provides access to foundation models, like…☆74Updated 5 months ago
- ☆79Updated last week
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆238Updated this week
- Run FMBench simultaneously across multiple Amazon EC2 machines to benchmark an FM across multiple serving stacks simultaneously☆14Updated this week
- ☆40Updated 2 weeks ago
- Example code for AWS Neuron SDK developers building inference and training applications☆140Updated last week
- ☆17Updated last year
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆42Updated last year
- ☆60Updated last year
- ☆37Updated 5 months ago
- AWS Generative AI Conversational RAG Reference (Galileo)☆74Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Updated last week
- ☆29Updated 11 months ago
- ☆30Updated 3 months ago
- The repository includes integrations with Amazon Bedrock and its included LLM, such as Amazon Titan and vector and graph database for a R…☆8Updated 5 months ago
- ☆21Updated last week
- ☆25Updated 3 weeks ago
- Try Artifacts and Code Interpreter Tool with Amazon Bedrock.☆95Updated 2 months ago
- ☆12Updated last year
- ☆28Updated last year
- Pre-built examples of Generative AI agents with Bedrock across multiple industries.☆38Updated last week