opea-project / GenAIInfra
Containerization and cloud native suite for OPEA
☆43Updated this week
Alternatives and similar repositories for GenAIInfra:
Users that are interested in GenAIInfra are comparing it to the libraries listed below
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆87Updated this week
- Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety…☆28Updated this week
- Model Server for Kepler☆27Updated last month
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆29Updated this week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆27Updated 3 months ago
- ☆19Updated last week
- ☆34Updated this week
- Gateway API Inference Extension☆176Updated this week
- The kernel module management operator builds, signs and loads kernel modules in Kubernetes clusters.☆96Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆63Updated this week
- Smart Kubernetes Scheduling☆76Updated this week
- ☆14Updated 9 months ago
- ☆93Updated last month
- Carbon Limiting Auto Tuning for Kubernetes☆35Updated 4 months ago
- ☆107Updated this week
- ☆50Updated last year
- WG Serving☆20Updated 3 weeks ago
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI se…☆164Updated this week
- Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open…☆382Updated this week
- GenAI Studio is a low code platform to enable users to construct, evaluate, and benchmark GenAI applications. The platform also provide c…☆33Updated 3 weeks ago
- ☆36Updated this week
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others☆33Updated 4 months ago
- A toolkit for discovering cluster network topology.☆37Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆62Updated last month
- ☆85Updated 6 months ago
- 🏃🏿♀️🏃🏽♀️🏃🏻♂️🕒CNCF Technical Advisory Group for Runtime☆91Updated 2 months ago
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆174Updated last month
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆92Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆194Updated this week
- Intent Driven Orchestration enables management of applications through their Service Level Objectives, while minimizing developer and adm…☆36Updated 2 months ago