rh-aiservices-bu / gpu-partitioning-guideLinks
Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others
☆54Updated last year
Alternatives and similar repositories for gpu-partitioning-guide
Users that are interested in gpu-partitioning-guide are comparing it to the libraries listed below
Sorting:
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆48Updated last week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆140Updated last week
- GenAI inference performance benchmarking tool☆134Updated last week
- Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.☆142Updated 3 months ago
- llm-d benchmark scripts and tooling☆36Updated this week
- Open Data Hub operator to manage ODH component integrations☆90Updated this week
- open-cluster-management governance material.☆64Updated last month
- ☆20Updated last week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆30Updated last year
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆27Updated 3 months ago
- ☆17Updated last week
- llm-d helm charts and deployment examples☆47Updated 2 weeks ago
- Cloud Native Artifacial Intelligence Model Format Specification☆147Updated last week
- Holistic job manager on Kubernetes☆115Updated last year
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆157Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆109Updated last month
- 🏃🏿♀️🏃🏽♀️🏃🏻♂️🕒CNCF Technical Advisory Group for Runtime☆95Updated 7 months ago
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆176Updated 10 months ago
- The kernel module management operator builds, signs and loads kernel modules in Kubernetes clusters.☆110Updated this week
- Simplified model deployment on llm-d☆27Updated 5 months ago
- ☆33Updated 3 weeks ago
- A collection of community maintained NRI plugins☆98Updated last week
- Deploy Development Builds of Open Cluster Management (OCM) on RedHat Openshift Container Platform☆169Updated last month
- AI-on-OpenShift website source code☆100Updated last week
- ☆20Updated 10 months ago
- An operator to run descheduler on OpenShift.☆56Updated this week
- WG Serving☆31Updated last month
- ☆89Updated last week
- ODH integration with AI at the Edge usecases☆12Updated last year
- An intuitive, easy-to-use python interface for batch resource requesting, access, job submission, and observation. Simplifying the develo…☆32Updated last week