Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton
☆461Jun 3, 2026Updated this week
Alternatives and similar repositories for ome
Users that are interested in ome are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serv…☆300May 14, 2026Updated 3 weeks ago
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13May 31, 2026Updated last week
- A workload for deploying LLM inference services on Kubernetes☆229Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆732Updated this week
- Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond☆1,064Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Gateway API Inference Extension