bentoml / YataiView external linksLinks
Model Deployment at Scale on Kubernetes π¦οΈ
β833May 8, 2024Updated last year
Alternatives and similar repositories for Yatai
Users that are interested in Yatai are comparing it to the libraries listed below
Sorting:
- Fast model deployment on any cloud πβ176Feb 25, 2024Updated last year
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!β8,435Updated this week
- π Launching Bento in a Kubernetes clusterβ17Mar 16, 2025Updated 10 months ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsβ4,719Updated this week
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetesβ5,108Updated this week
- Sentence Embedding as a Serviceβ15Jun 30, 2025Updated 7 months ago
- ποΈ Reproducible development environment for humans and agentsβ2,181Updated this week
- Simple dependency injection framework for Pythonβ21May 15, 2024Updated last year
- ZenML π: One AI Platform from Pipelines to Agents. https://zenml.io.β5,194Feb 6, 2026Updated last week
- A small utility module to make it simple to build BentoML Services into images inside Kubernetes clusters.β10Dec 15, 2020Updated 5 years ago
- An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and moreβ873Feb 6, 2026Updated last week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.β12,099Jan 26, 2026Updated 2 weeks ago
- The Open Source Feature Store for AI/MLβ6,702Updated this week
- BentoML Example Projects π¨β142Jan 6, 2025Updated last year
- A bridge between different serde implementations.β16Sep 8, 2025Updated 5 months ago
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, oβ¦β9,442Updated this week
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machineβ895Feb 4, 2026Updated last week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.β6,715Updated this week
- Automated Machine Learning on Kubernetesβ1,658Feb 6, 2026Updated last week
- ML pipeline orchestration and model deployments on Kubernetes.β435Aug 18, 2023Updated 2 years ago
- Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. β rebuilt from scratch. Unified architecture on your S3.β9,148Updated this week
- Build, Manage and Deploy AI/ML Systemsβ9,746Feb 5, 2026Updated last week
- Machine Learning Toolkit for Kubernetesβ15,446Jan 5, 2026Updated last month
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)β280Nov 3, 2023Updated 2 years ago
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.β1,964Jul 3, 2025Updated 7 months ago
- The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, β¦β24,051Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.β10,334Feb 6, 2026Updated last week
- ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling β¦β6,501Feb 6, 2026Updated last week
- Yet another virtualization runtime, make Virtual Machine greate again!β28Jan 20, 2026Updated 3 weeks ago
- An open-source data logging library for machine learning models and data pipelines. π Provides visibility into data quality & model perfβ¦β2,795Jan 10, 2025Updated last year
- An embeddable graph database for large-scale vertices and edgesβ74Apr 16, 2023Updated 2 years ago
- A must-have configuration for Spacemacs users after defecting to Vimβ294Aug 15, 2025Updated 5 months ago
- π³ Build OCI images for Bentos in k8sβ20Jan 23, 2026Updated 3 weeks ago
- Docker for Your ML/DL Models Based on OCI Artifactsβ474Jan 26, 2024Updated 2 years ago
- Event streaming platform for agents, apps, and analytics. Continuously ingest, transform, and serve event data in real time, at scale.β8,774Updated this week
- MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integraβ¦β1,649Feb 5, 2026Updated last week
- Apache OpenDAL: One Layer, All Storage.β4,885Updated this week
- π¦ Data Versioning and ML Experimentsβ15,347Feb 1, 2026Updated last week
- πΆ A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one dayπ€β719Sep 13, 2023Updated 2 years ago