awslabs / llm-hosting-containerLinks
Large Language Model Hosting Container
☆91Updated 3 months ago
Alternatives and similar repositories for llm-hosting-container
Users that are interested in llm-hosting-container are comparing it to the libraries listed below
Sorting:
- Training and inference on AWS Trainium and Inferentia chips.☆254Updated last week
- ☆56Updated 6 months ago
- ☆64Updated 8 months ago
- Leverage your LangChain trace data for fine tuning☆46Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆30Updated 2 years ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- ☆111Updated last year
- Example code for AWS Neuron SDK developers building inference and training applications☆155Updated this week
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.☆131Updated last month
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment☆31Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Updated last year
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines☆31Updated 2 years ago
- Deploy llama.cpp compatible Generative AI LLMs on AWS Lambda!☆177Updated last year
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated 2 years ago
- The backend behind the LLM-Perf Leaderboard☆11Updated last year
- ☆54Updated last year
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Service☆28Updated last year
- ☆89Updated 2 years ago
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- CodeSage: Code Representation Learning At Scale (ICLR 2024)☆115Updated last year
- ☆46Updated last year
- experiments with inference on llama☆103Updated last year
- Tutorial to get started with SkyPilot!☆58Updated last year
- ☆147Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆25Updated 2 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆111Updated last year
- ☆21Updated last year
- A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍☆13Updated 9 months ago