Model Deployment at Scale on Kubernetes π¦οΈ
β836May 8, 2024Updated last year
Alternatives and similar repositories for Yatai
Users that are interested in Yatai are comparing it to the libraries listed below
Sorting:
- Fast model deployment on any cloud πβ176Feb 25, 2024Updated 2 years ago
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!β8,487Updated this week
- π Launching Bento in a Kubernetes clusterβ17Mar 16, 2025Updated 11 months ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsβ4,731Updated this week
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetesβ5,162Updated this week
- Sentence Embedding as a Serviceβ15Jun 30, 2025Updated 8 months ago
- ποΈ Reproducible development environment for humans and agentsβ2,188Updated this week
- Simple dependency injection framework for Pythonβ21May 15, 2024Updated last year
- ZenML π: One AI Platform from Pipelines to Agents. https://zenml.io.β5,245Updated this week
- A small utility module to make it simple to build BentoML Services into images inside Kubernetes clusters.β10Dec 15, 2020Updated 5 years ago
- An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and moreβ876Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.β12,148Updated this week
- The Open Source Feature Store for AI/MLβ6,756Updated this week
- BentoML Example Projects π¨β142Jan 6, 2025Updated last year
- A bridge between different serde implementations.β16Sep 8, 2025Updated 5 months ago
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, oβ¦β9,516Updated this week
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machineβ892Updated this week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.β6,767Updated this week
- Automated Machine Learning on Kubernetesβ1,660Feb 28, 2026Updated last week
- ML pipeline orchestration and model deployments on Kubernetes.β435Aug 18, 2023Updated 2 years ago
- Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. β rebuilt from scratch. Unified architecture on your S3.β9,174Updated this week
- Build, Manage and Deploy AI/ML Systemsβ9,903Updated this week
- Machine Learning Toolkit for Kubernetesβ15,482Jan 5, 2026Updated 2 months ago
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)β281Nov 3, 2023Updated 2 years ago
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.β1,967Jul 3, 2025Updated 8 months ago
- The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, β¦β24,485Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.β10,406Updated this week
- ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling β¦β6,548Feb 28, 2026Updated last week
- Yet another virtualization runtime, make Virtual Machine greate again!β28Jan 20, 2026Updated last month
- An open-source data logging library for machine learning models and data pipelines. π Provides visibility into data quality & model perfβ¦β2,800Jan 10, 2025Updated last year
- An embeddable graph database for large-scale vertices and edgesβ74Apr 16, 2023Updated 2 years ago
- A must-have configuration for Spacemacs users after defecting to Vimβ296Aug 15, 2025Updated 6 months ago
- π³ Build OCI images for Bentos in k8sβ20Updated this week
- Docker for Your ML/DL Models Based on OCI Artifactsβ474Jan 26, 2024Updated 2 years ago
- Event streaming platform for agents, apps, and analytics. Continuously ingest, transform, and serve event data in real time, at scale.β8,839Updated this week
- MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integraβ¦β1,655Updated this week
- Apache OpenDAL: One Layer, All Storage.β4,923Updated this week
- π¦ Data Versioning and ML Experimentsβ15,404Feb 27, 2026Updated last week
- πΆ A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one dayπ€β718Sep 13, 2023Updated 2 years ago