rh-aiservices-bu / gpu-partitioning-guideLinks
Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others
☆48Updated 11 months ago
Alternatives and similar repositories for gpu-partitioning-guide
Users that are interested in gpu-partitioning-guide are comparing it to the libraries listed below
Sorting:
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆45Updated this week
- Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.☆136Updated last month
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆27Updated last month
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆129Updated last week
- open-cluster-management governance material.☆64Updated 3 weeks ago
- Open Data Hub operator to manage ODH component integrations☆86Updated this week
- ☆19Updated this week
- llm-d benchmark scripts and tooling☆30Updated last week
- AI-on-OpenShift website source code☆94Updated 2 months ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆94Updated 3 weeks ago
- Deploy Development Builds of Open Cluster Management (OCM) on RedHat Openshift Container Platform☆166Updated 3 weeks ago
- A global load balancer operator for OpenShift☆54Updated last year
- ☆16Updated this week
- Repository to deploy LLMs with Multi-GPUs in distributed Kubernetes nodes☆27Updated 9 months ago
- GenAI inference performance benchmarking tool☆99Updated last week
- Various custom Workbenches and Runtimes for Open Data Hub and OpenShift Data Science☆48Updated last year
- Red Hat Advanced Cluster Management for Kubernetes documentation☆75Updated this week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆30Updated 10 months ago
- DPTP Tooling☆46Updated this week
- ☆88Updated this week
- ☆26Updated 2 years ago
- Kubernetes integration for OVN☆86Updated last week
- tektoncd-pipeline operator for Kubernetes to manage installation, updation and uninstallation of tekton-cd pipelines.☆54Updated 4 years ago
- OADP Operator☆86Updated this week
- Lifecycle manager for internet-disconnected OpenShift environments☆106Updated this week
- ☆20Updated 8 months ago
- Operator for NooBaa - object data service for hybrid and multi cloud environments☆118Updated last week
- WG Serving☆30Updated last week
- CLI for the Red Hat OpenShift Cluster Manager☆86Updated this week
- Collection of demos for building Llama Stack based apps on OpenShift☆55Updated 2 weeks ago