awslabs / llm-hosting-container
Large Language Model Hosting Container
☆87Updated this week
Alternatives and similar repositories for llm-hosting-container:
Users that are interested in llm-hosting-container are comparing it to the libraries listed below
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆226Updated this week
- Example code for AWS Neuron SDK developers building inference and training applications☆140Updated last week
- ☆53Updated 4 months ago
- ☆104Updated 3 months ago
- ☆36Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Updated last week
- Use LLMs for building real-world apps☆110Updated 3 months ago
- ☆61Updated 2 weeks ago
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment☆32Updated last year
- ☆22Updated last year
- ☆42Updated 5 months ago
- Hands-on workshop for distributed training and hosting on SageMaker☆134Updated 2 weeks ago
- Amazon SageMaker Managed Spot Training Examples☆51Updated 9 months ago
- Foundation Model Evaluations Library☆243Updated last week
- A generative AI-powered framework for testing virtual agents.☆216Updated 2 weeks ago
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines☆31Updated last year
- ☆253Updated 2 weeks ago
- experiments with inference on llama☆104Updated 10 months ago
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆138Updated 6 months ago
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆238Updated this week
- This is a short example showing how to utilize Amazon SageMaker's real time endpoints with OpenAI's open source Whisper model for audio t…☆67Updated last year
- Repository for training and deploying Generative AI models, including text-text, text-to-image generation and prompt engineering playgrou…☆143Updated this week
- ☆24Updated last year
- ☆88Updated last year
- A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍☆13Updated this week
- ☆57Updated 2 years ago
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Service☆24Updated 4 months ago
- ☆54Updated last week
- ☆45Updated 2 months ago
- ☆36Updated last year