opea-project / GenAIInfra
Containerization and cloud native suite for OPEA
☆37Updated this week
Alternatives and similar repositories for GenAIInfra:
Users that are interested in GenAIInfra are comparing it to the libraries listed below
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆82Updated this week
- Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety…☆26Updated this week
- Intent Driven Orchestration enables management of applications through their Service Level Objectives, while minimizing developer and adm…☆36Updated last month
- Model Server for Kepler☆27Updated last week
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆27Updated this week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆27Updated 2 months ago
- Examples for building and running LLM services and applications locally with Podman☆132Updated this week
- ☆105Updated this week
- ☆13Updated 8 months ago
- GenAI Studio is a low code platform to enable users to construct, evaluate, and benchmark GenAI applications. The platform also provide c…☆25Updated this week
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆174Updated 3 weeks ago
- ☆19Updated last month
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.☆48Updated this week
- ☆85Updated 5 months ago
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI se…☆132Updated this week
- Carbon Limiting Auto Tuning for Kubernetes☆33Updated 3 months ago
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆257Updated this week
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others☆32Updated 4 months ago
- ☆34Updated 2 weeks ago
- ☆21Updated 2 months ago
- Repository for open inference protocol specification☆46Updated 6 months ago
- ☆34Updated last week
- ☆82Updated 2 months ago
- 🏃🏿♀️🏃🏽♀️🏃🏻♂️🕒CNCF Technical Advisory Group for Runtime☆90Updated 2 months ago
- ☆49Updated 11 months ago
- AI cloud native pipeline for confidential and sustainable computing☆39Updated 4 months ago
- Operator to deploy confidential containers runtime☆120Updated last week
- Smart Kubernetes Scheduling☆73Updated this week
- K8s device plugin for GPU sharing☆99Updated last year
- Slurm in Kubernetes☆42Updated 2 months ago