awslabs / llm-hosting-containerLinks
Large Language Model Hosting Container
☆89Updated last week
Alternatives and similar repositories for llm-hosting-container
Users that are interested in llm-hosting-container are comparing it to the libraries listed below
Sorting:
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆227Updated last week
- Example code for AWS Neuron SDK developers building inference and training applications☆146Updated this week
- ☆62Updated last month
- ☆54Updated 6 months ago
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines☆31Updated last year
- ☆108Updated 4 months ago
- ☆54Updated last year
- ☆22Updated last year
- ☆44Updated 7 months ago
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment☆32Updated last year
- Use LLMs for building real-world apps☆112Updated 4 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated last week
- Leverage your LangChain trace data for fine tuning☆41Updated 10 months ago
- ☆88Updated last year
- A do-framework project to simplify deployment of Kubeflow on Amazon EKS☆21Updated 3 months ago
- ☆60Updated 2 months ago
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆142Updated 7 months ago
- A generative AI-powered framework for testing virtual agents.☆239Updated 2 months ago
- ☆72Updated 11 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- ☆24Updated this week
- ☆262Updated last month
- ☆18Updated 4 months ago
- ☆51Updated last month
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Service☆25Updated 6 months ago
- A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍☆13Updated last month
- Hands-on workshop for distributed training and hosting on SageMaker☆139Updated last week
- ☆24Updated last year
- ☆39Updated last month
- AWS Generative AI Conversational RAG Reference (Galileo)☆76Updated this week