opea-project / GenAIInfra
Containerization and cloud native suite for OPEA
☆44Updated this week
Alternatives and similar repositories for GenAIInfra:
Users that are interested in GenAIInfra are comparing it to the libraries listed below
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆88Updated this week
- Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety…☆28Updated last week
- Model Server for Kepler☆27Updated last month
- 🏃🏿♀️🏃🏽♀️🏃🏻♂️🕒CNCF Technical Advisory Group for Runtime☆91Updated 3 months ago
- 🎉 An awesome & curated list of best LLMOps tools.☆63Updated this week
- ☆19Updated this week
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI se…☆189Updated this week
- Repository for open inference protocol specification☆50Updated 8 months ago
- Gateway API Inference Extension☆183Updated this week
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆177Updated 2 weeks ago
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆27Updated 3 months ago
- ☆107Updated this week
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆30Updated this week
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.☆63Updated this week
- ☆85Updated 6 months ago
- Smart Kubernetes Scheduling☆76Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆346Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆63Updated this week
- A toolkit for discovering cluster network topology.☆39Updated this week
- ☆50Updated last year
- ☆14Updated 9 months ago
- GenAI inference performance benchmarking tool☆20Updated this week
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆105Updated this week
- ☆34Updated this week
- Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open…☆395Updated this week
- Cloud Native Artifacial Intelligence Model Format Specification☆32Updated this week
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others☆34Updated 5 months ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆64Updated last week
- ☆36Updated last week
- This repo contains documents of the OPEA project☆30Updated this week