awslabs / llm-hosting-container
Large Language Model Hosting Container
☆80Updated last week
Alternatives and similar repositories for llm-hosting-container:
Users that are interested in llm-hosting-container are comparing it to the libraries listed below
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆216Updated this week
- Example code for AWS Neuron SDK developers building inference and training applications☆132Updated 2 weeks ago
- A generative AI-powered framework for testing virtual agents.☆160Updated last month
- ☆60Updated last month
- ☆33Updated 2 months ago
- ☆51Updated last month
- ☆40Updated 3 months ago
- ☆22Updated 9 months ago
- ☆38Updated this week
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆136Updated 3 months ago
- ☆102Updated 2 weeks ago
- Hands-on workshop for distributed training and hosting on SageMaker☆130Updated 2 months ago
- MLOps End-to-End Example using Amazon SageMaker Pipeline, AWS CodePipeline and AWS CDK☆135Updated last month
- Use LLMs for building real-world apps☆111Updated 2 weeks ago
- ☆21Updated last year
- ☆88Updated last year
- Foundation Model Evaluations Library☆227Updated 2 weeks ago
- AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large L…☆65Updated this week
- A do-framework project to simplify deployment of Kubeflow on Amazon EKS☆20Updated 2 weeks ago
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines☆31Updated last year
- ☆66Updated 7 months ago
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆220Updated this week
- A simple Streamlit application that helps visualize document chunks and queries in embedding space 🗺️🔍☆12Updated 5 months ago
- This repository is intended for those looking to dive deep on advanced Text-to-SQL concepts.☆99Updated 2 months ago
- ☆243Updated 3 months ago
- AWS Generative AI Conversational RAG Reference (Galileo)☆73Updated 3 weeks ago
- This project simplifies personalized Gen-AI SaaS apps. We fine-tune pre-trained models for users, use single GPUs, and ensure real-time r…☆20Updated last year
- Use natural language to Generate Amazon Athena SQL queries to fetch data.☆59Updated 2 months ago
- ☆57Updated 2 years ago
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment☆32Updated last year