substratusai / vllm-dockerLinks

☆63

Alternatives and similar repositories for vllm-docker

Users that are interested in vllm-docker are comparing it to the libraries listed below

Sorting:

aniketmaurya / fastserve-ai
Machine Learning Serving focused on GenAI with simplicity as the top priority.
☆59Updated 3 weeks ago
bentoml / BentoVLLM
Self-host LLMs with vLLM and BentoML
☆139Updated last week
h2oai / enterprise-h2ogpte
Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
☆87Updated last month
weaviate / structured-rag
Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models
☆111Updated 3 months ago
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆137Updated last year
shivamsanju / ragswift
🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform
☆38Updated last year
SalesforceAIResearch / SFR-RAG
☆77Updated 6 months ago
etalab-ia / albert-models
Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.
☆42Updated last year
mani-kantap / llm-inference-solutions
A collection of all available inference solutions for the LLMs
☆91Updated 5 months ago
amogkam / llama_index_ray
Using LlamaIndex with Ray for productionizing LLM applications
☆71Updated 2 years ago
ianhohoho / auto-hyde
🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…
☆32Updated last year
backprop-ai / vllm-benchmark
Benchmarking the serving capabilities of vLLM
☆48Updated 11 months ago
IBM / text-generation-inference
IBM development fork of https://github.com/huggingface/text-generation-inference
☆61Updated 2 months ago
anyscale / llm-router
Tutorial for building LLM router
☆220Updated last year
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆91Updated last year
nyunAI / PruneGPT
☆51Updated last year
Preemo-Inc / text-generation-inference
☆199Updated last year
substratusai / helm
☆18Updated 11 months ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 8 months ago
titanml / takeoff-community
TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…
☆114Updated last year
asprenger / ray_vllm_inference
A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.
☆69Updated last year
langchain-ai / langchain-elastic
Elasticsearch integration into LangChain
☆58Updated 5 months ago
zozoheir / tinyllm
Develop, evaluate and monitor LLM applications at scale
☆100Updated 8 months ago
log10-io / log10
Python client library for improving your LLM app accuracy
☆98Updated 5 months ago
stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆173Updated 10 months ago
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆217Updated this week
guidance-ai / jsonschemabench
☆51Updated last month
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆168Updated last year
IlyasMoutawwakil / llm-perf-backend
The backend behind the LLM-Perf Leaderboard
☆10Updated last year
qdrant / bm42_eval
Evaluation of bm42 sparse indexing algorithm
☆68Updated last year