sustainable-computing-io / kepler-model-serverLinks
Model Server for Kepler
☆29Updated last month
Alternatives and similar repositories for kepler-model-server
Users that are interested in kepler-model-server are comparing it to the libraries listed below
Sorting:
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆134Updated last week
- Holistic job manager on Kubernetes☆116Updated last year
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆176Updated 9 months ago
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆30Updated 11 months ago
- A Topology-Aware Custom Scheduler For Kubernetes☆66Updated 2 years ago
- 🏃🏿♀️🏃🏽♀️🏃🏻♂️🕒CNCF Technical Advisory Group for Runtime☆95Updated 7 months ago
- Intent Driven Orchestration enables management of applications through their Service Level Objectives, while minimizing developer and adm…☆44Updated 2 months ago
- ☆87Updated last year
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆47Updated this week
- ☆15Updated last year
- Example DRA driver that developers can fork and modify to get them started writing their own.☆105Updated 3 weeks ago
- Artifacts for the Distributed Workloads stack as part of ODH☆33Updated this week
- Container Level Energy-efficient VPA Recommender☆25Updated last month
- More Flexible Device Extension Capability in Kubernetes (DevicePlugins++)☆23Updated 2 years ago
- llm-d benchmark scripts and tooling☆33Updated this week
- ☆20Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆71Updated 4 months ago
- ☆38Updated last week
- ☆40Updated 2 weeks ago
- A collection of community maintained NRI plugins☆97Updated this week
- Carbon Limiting Auto Tuning for Kubernetes☆37Updated last year
- Enables multicluster application delivery.☆44Updated 6 months ago
- Test Orchestrator for Performance and Scalability of AI pLatforms☆16Updated last week
- GenAI inference performance benchmarking tool☆123Updated this week
- PEAKS: Power Efficiency Aware Kubernetes Scheduler☆36Updated last year
- WG Serving☆31Updated last month
- llm-d helm charts and deployment examples☆46Updated last month
- Managing the lifecycle for a group of operands☆34Updated this week
- open-cluster-management governance material.☆64Updated 3 weeks ago
- Cloud Native Artifacial Intelligence Model Format Specification☆116Updated last week