rh-aiservices-bu / gpu-partitioning-guideLinks
Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others
☆48Updated 11 months ago
Alternatives and similar repositories for gpu-partitioning-guide
Users that are interested in gpu-partitioning-guide are comparing it to the libraries listed below
Sorting:
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆44Updated this week
- ☆19Updated this week
- Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.☆136Updated 2 weeks ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆125Updated last week
- Cloud Native Artifacial Intelligence Model Format Specification☆94Updated last week
- Open Data Hub operator to manage ODH component integrations☆86Updated this week
- GenAI inference performance benchmarking tool☆95Updated this week
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆27Updated 3 weeks ago
- Simplified model deployment on llm-d☆27Updated 2 months ago
- WG Serving☆30Updated last week
- Artifacts for the Distributed Workloads stack as part of ODH☆33Updated this week
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆149Updated this week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆30Updated 9 months ago
- AI-on-OpenShift website source code☆92Updated last month
- ☆16Updated last week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆89Updated last week
- ODH integration with AI at the Edge usecases☆12Updated 10 months ago
- ☆20Updated 7 months ago
- Test Orchestrator for Performance and Scalability of AI pLatforms☆15Updated last week
- open-cluster-management governance material.☆64Updated this week
- ☆59Updated last year
- The kernel module management operator builds, signs and loads kernel modules in Kubernetes clusters.☆107Updated this week
- Operator for NooBaa - object data service for hybrid and multi cloud environments☆117Updated this week
- 🏃🏿♀️🏃🏽♀️🏃🏻♂️🕒CNCF Technical Advisory Group for Runtime☆95Updated 5 months ago
- Holistic job manager on Kubernetes☆116Updated last year
- CSI driver for sharing Secrets and ConfigMaps across namespaces.☆38Updated this week
- The AI Accelerator is a template project for setting up Red Hat OpenShift AI using GitOps☆56Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆260Updated last week
- A global load balancer operator for OpenShift☆54Updated last year
- An operator to run descheduler on OpenShift.☆55Updated this week