awslabs / llm-hosting-containerLinks

Large Language Model Hosting Container

☆90

Alternatives and similar repositories for llm-hosting-container

Users that are interested in llm-hosting-container are comparing it to the libraries listed below

Sorting:

huggingface / optimum-neuron
Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
☆235Updated this week
awslabs / extending-the-context-length-of-open-source-llms
☆56Updated last month
aws-neuron / aws-neuron-samples
Example code for AWS Neuron SDK developers building inference and training applications
☆148Updated this week
aws-samples / llm-evaluation-methodology
☆44Updated 9 months ago
parlance-labs / langfree
Leverage your LangChain trace data for fine tuning
☆42Updated last year
langchain-ai / prompt-eval-recommendation
Streamlit app for recommending eval functions using prompt diffs
☆29Updated last year
cohere-ai / cohere-aws
☆62Updated 3 months ago
aws-neuron / transformers-neuronx
☆112Updated 6 months ago
arunprsh / knowledge-augmented-LLMs
Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment
☆32Updated 2 years ago
JGalego / RAGmap
A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍
☆13Updated 3 months ago
parea-ai / parea-sdk-py
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
☆78Updated 5 months ago
davanstrien / data-for-fine-tuning-llms
☆79Updated last year
hamelsmu / ft-drift
Check for data drift between two OpenAI multi-turn chat jsonl files.
☆37Updated last year
titanml / takeoff-community
TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…
☆114Updated last year
anyscale / e2e-llm-workflows
Fine-tune an LLM to perform batch inference and online serving.
☆112Updated 2 months ago
fw-ai / cookbook
Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.
☆120Updated this week
aws-samples / aiml-genai-multimodal-agent
☆54Updated last year
aws-samples / llm-based-advanced-summarization
☆50Updated 2 months ago
IlyasMoutawwakil / llm-perf-backend
The backend behind the LLM-Perf Leaderboard
☆10Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
wenqiglantz / edd-recursive-doc-agent-vs-metadata-replacement
Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines
☆31Updated last year
anyscale / ray-summit-2023-training
☆87Updated last year
aws / sagemaker-huggingface-inference-toolkit
☆264Updated 3 months ago
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 9 months ago
substratusai / vllm-docker
☆63Updated 4 months ago
NielsRogge / awesome-huggingface
Repository containing awesome resources regarding Hugging Face tooling.
☆47Updated last year
aws-samples / amazon-sagemaker-managed-spot-training
Amazon SageMaker Managed Spot Training Examples
☆51Updated last year
cfahlgren1 / observers
A Lightweight Library for AI Observability
☆250Updated 5 months ago
lancedb / ragged
☆20Updated 9 months ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year