bentoml / OpenLLMLinks
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
☆11,824Updated this week
Alternatives and similar repositories for OpenLLM
Users that are interested in OpenLLM are comparing it to the libraries listed below
Sorting:
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,451Updated last week
- Large Language Model Text Generation Inference☆10,550Updated 3 weeks ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,804Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,453Updated 4 months ago
- Python bindings for llama.cpp☆9,635Updated last month
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,521Updated 2 years ago
- Universal LLM Deployment Engine with ML Compilation☆21,440Updated this week
- A guidance language for controlling large language models.☆20,817Updated last week
- Open-source search and retrieval database for AI applications.☆23,665Updated this week
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,783Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆59,413Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,680Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,144Updated 4 months ago
- Go ahead and axolotl questions☆10,551Updated this week
- 📋 A list of open LLMs available for commercial use.☆12,429Updated 7 months ago
- Open source codebase powering the HuggingChat app☆9,197Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆44,575Updated this week
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, o…☆8,786Updated last week
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,430Updated last year
- Home of StarCoder: fine-tuning & inference!☆7,459Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,817Updated last week
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,951Updated 5 months ago
- Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.☆18,684Updated this week
- H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/☆4,653Updated last week
- Structured Outputs☆12,648Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,925Updated this week
- High-performance In-browser LLM Inference Engine☆16,601Updated 3 weeks ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,336Updated last month
- Tensor library for machine learning☆13,230Updated last week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆46,548Updated this week