The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
β8,520Mar 16, 2026Updated this week
Alternatives and similar repositories for BentoML
Users that are interested in BentoML are comparing it to the libraries listed below
Sorting:
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsβ4,735Updated this week
- Model Deployment at Scale on Kubernetes π¦οΈβ838May 8, 2024Updated last year
- The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, aβ¦β24,874Updated this week
- The Open Source Feature Store for AI/MLβ6,808Updated this week
- Build, Manage and Deploy AI/ML Systemsβ9,956Updated this week
- ZenML π: One AI Platform from Pipelines to Agents. https://zenml.io.β5,281Updated this week
- π¦ Data Versioning and ML Experimentsβ15,458Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.β12,174Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β41,799Updated this week
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetesβ5,216Updated this week
- Production infrastructure for machine learning at scaleβ8,028Jun 12, 2024Updated last year
- A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learningβ20,255Mar 5, 2026Updated 2 weeks ago
- ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling β¦β6,582Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,798Updated this week
- Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Froβ¦β7,308Mar 10, 2026Updated last week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.β10,446Updated this week
- Serve, optimize and scale PyTorch models in productionβ4,360Aug 6, 2025Updated 7 months ago
- Machine Learning Toolkit for Kubernetesβ15,527Jan 5, 2026Updated 2 months ago
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β21,910Updated this week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.β30,952Updated this week
- βΎοΈ CML - Continuous Machine Learning | CI/CD for MLβ4,170Jun 2, 2025Updated 9 months ago
- Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows. Flyte 2 now available locally: httpsβ¦β6,896Updated this week
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and aβ¦β24,585Updated this week
- A curated list of references for MLOpsβ13,813Nov 21, 2024Updated last year
- Streamlit β A faster way to build and share data apps.β43,928Updated this week
- Low-code framework for building custom LLMs, neural networks, and other AI modelsβ11,657Updated this week
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β42,053Updated this week
- βοΈ Build multimodal AI applications with cloud-native stackβ21,849Mar 24, 2025Updated 11 months ago
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clβ¦β9,664Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ73,479Updated this week
- Always know what to expect from your data.β11,280Updated this week
- Modin: Scale your Pandas workflows by changing a single line of codeβ10,363Feb 10, 2026Updated last month
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β3,994Dec 28, 2025Updated 2 months ago
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycleβ3,697Mar 9, 2026Updated last week
- Milvus is a high-performance, cloud-native vector database built for scalable vector ANN searchβ43,435Updated this week
- the GPU-native, sandboxed Postgres for AI agentsβ9,040Feb 16, 2026Updated last month
- Label Studio is a multi-type data labeling and annotation tool with standardized output formatβ26,791Updated this week
- Aim π« β An easy-to-use & supercharged open-source experiment tracker.β6,043Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,869Updated this week