opea-project / GenAIInfra
Containerization and cloud native suite for OPEA
☆57Updated this week
Alternatives and similar repositories for GenAIInfra:
Users that are interested in GenAIInfra are comparing it to the libraries listed below
- Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety…☆34Updated this week
- This repo contains documents of the OPEA project☆38Updated this week
- Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open…☆441Updated this week
- GenAI components at micro-service level; GenAI service composer to create mega-service☆145Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆100Updated this week
- GenAI Studio is a low code platform to enable users to construct, evaluate, and benchmark GenAI applications. The platform also provide c…☆38Updated this week
- GenAI inference performance benchmarking tool☆41Updated this week
- Helm charts for the KubeRay project☆43Updated last month
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆119Updated this week
- Cloud Native Artifacial Intelligence Model Format Specification☆41Updated this week
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆179Updated this week
- ☆40Updated last month
- ☆81Updated 5 months ago
- A toolkit for discovering cluster network topology.☆46Updated last week
- 🎉 An awesome & curated list of best LLMOps tools.☆97Updated this week
- ☆19Updated 2 weeks ago
- ☆24Updated this week
- Repository for open inference protocol specification☆54Updated 9 months ago
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.☆67Updated this week
- ☆85Updated 8 months ago
- Model Server for Kepler☆27Updated this week
- WG Serving☆24Updated 3 weeks ago
- ☆36Updated this week
- AppWrapper controller for Kueue☆13Updated last week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆69Updated this week
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆159Updated last week
- Smart Kubernetes Scheduling☆78Updated this week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆27Updated 5 months ago
- K8s device plugin for GPU sharing☆100Updated 2 years ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆35Updated this week