bentoml/OpenLLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bentoml/OpenLLM)

bentoml / OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

☆12,406

Alternatives and similar repositories for OpenLLM

Users that are interested in OpenLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆51,145Updated this week
huggingface / text-generation-inference
View on GitHub
Large Language Model Text Generation Inference
☆10,882Mar 21, 2026Updated 4 months ago
bentoml / BentoML
View on GitHub
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
☆8,737Jul 20, 2026Updated last week
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆87,317Updated this week
lm-sys / FastChat
View on GitHub
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆39,505May 1, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
guidance-ai / guidance
View on GitHub
A guidance language for controlling large language models.
☆21,693May 21, 2026Updated 2 months ago
ShishirPatil / gorilla
View on GitHub
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
☆12,968Apr 13, 2026Updated 3 months ago
microsoft / autogen
View on GitHub
A programming framework for agentic AI
☆60,029Apr 15, 2026Updated 3 months ago
langchain-ai / langchain
View on GitHub
The agent engineering platform.
☆142,699Updated this week
oobabooga / textgen
View on GitHub
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
☆47,492Jun 2, 2026Updated last month
BerriAI / litellm
View on GitHub
The fastest, litest AI Gateway. Rust core with Python SDK. Call 100+ LLM APIs in OpenAI (or native) format with cost tracking, guardrails…
☆54,835Updated this week
deepset-ai / haystack
View on GitHub
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…
☆26,032Updated this week
mudler / LocalAI
View on GitHub
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
☆47,928Updated this week
mlc-ai / mlc-llm
View on GitHub
Universal LLM Deployment Engine with ML Compilation
☆23,002Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FlowiseAI / Flowise
View on GitHub
Build AI Agents, Visually
☆54,965Updated this week
nomic-ai / gpt4all
View on GitHub
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
☆77,403May 27, 2025Updated last year
zylon-ai / private-gpt
View on GitHub
Complete API layer for private AI applications on local models: RAG, skills, tools, MCP, text-to-sql, and more. Works with any OpenAI-com…
☆57,380Updated this week
ggml-org / llama.cpp
View on GitHub
LLM inference in C/C++
☆121,787Updated this week
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,410Updated this week
letta-ai / letta
View on GitHub
Platform for stateful agents: AI with advanced memory that can learn and self-improve over time.
☆23,985Jul 22, 2026Updated last week
QuivrHQ / quivr
View on GitHub
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products wi…
☆39,362Jul 9, 2025Updated last year
nlpxucan / WizardLM
View on GitHub
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,481Jun 7, 2025Updated last year
Lightning-AI / litgpt
View on GitHub
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆13,587Jul 20, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
eugeneyan / open-llms
View on GitHub
📋 A list of open LLMs available for commercial use.
☆12,840Feb 13, 2025Updated last year
skypilot-org / skypilot
View on GitHub
The AI Compute Platform for frontier teams. SkyPilot turns fragmented AI compute into one AI supercomputer, so frontier AI teams build cu…
☆10,410Updated this week
mem0ai / mem0
View on GitHub
Universal memory layer for AI Agents
☆61,841Updated this week
TransformerOptimus / SuperAGI
View on GitHub
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agent…
☆17,641Jan 22, 2025Updated last year
dottxt-ai / outlines
View on GitHub
Structured Outputs
☆15,354Updated this week
ollama / ollama
View on GitHub
Get up and running with Kimi-K2.6, GLM-5.2, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
☆176,993Updated this week
zilliztech / GPTCache
View on GitHub
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
☆8,112Jul 11, 2025Updated last year
openlm-research / open_llama
View on GitHub
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,533Jul 16, 2023Updated 3 years ago
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,965Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
AntonOsika / gpt-engineer
View on GitHub
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
☆55,190May 14, 2025Updated last year
mlc-ai / web-llm
View on GitHub
High-performance In-browser LLM Inference Engine
☆18,467Jun 9, 2026Updated last month
chroma-core / chroma
View on GitHub
Search infrastructure for AI
☆28,890Updated this week
Mintplex-Labs / anything-llm
View on GitHub
Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience
☆63,878Updated this week
openinterpreter / openinterpreter
View on GitHub
A coding agent for open models like Kimi K3
☆67,341Updated this week
bigscience-workshop / petals
View on GitHub
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
☆10,452Sep 7, 2024Updated last year
neuml / txtai
View on GitHub
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
☆12,760Updated this week