rh-aiservices-bu / gpu-partitioning-guideLinks
Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others
☆55Updated last year
Alternatives and similar repositories for gpu-partitioning-guide
Users that are interested in gpu-partitioning-guide are comparing it to the libraries listed below
Sorting:
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆48Updated this week
- llm-d helm charts and deployment examples☆48Updated 2 weeks ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆140Updated 2 weeks ago
- open-cluster-management governance material.☆64Updated 2 months ago
- GenAI inference performance benchmarking tool☆137Updated last week
- Inference scheduler for llm-d☆111Updated last week
- Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.☆144Updated 3 months ago
- llm-d benchmark scripts and tooling☆40Updated last week
- ☆20Updated last week
- Open Data Hub operator to manage ODH component integrations☆90Updated this week
- Holistic job manager on Kubernetes☆115Updated last year
- Simplified model deployment on llm-d☆28Updated 5 months ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆110Updated 2 months ago
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆27Updated 4 months ago
- Test Orchestrator for Performance and Scalability of AI pLatforms☆16Updated last week
- Kubernetes integration for OVN☆91Updated this week
- Sherlock - a set of script to assess database performance on OCP/k8s☆31Updated last year
- Helm charts for llm-d☆50Updated 5 months ago
- DPTP Tooling☆45Updated last week
- API driven OpenShift cluster provisioning and management☆268Updated this week
- A global load balancer operator for OpenShift☆56Updated last year
- Cloud Native Benchmarking of Foundation Models☆44Updated 5 months ago
- Cloud Native Artifacial Intelligence Model Format Specification☆157Updated 2 weeks ago
- Red Hat Advanced Cluster Management for Kubernetes documentation☆75Updated last week
- Deploy Development Builds of Open Cluster Management (OCM) on RedHat Openshift Container Platform☆170Updated 2 months ago
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆30Updated last year
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆176Updated 11 months ago
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆158Updated last week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆291Updated last week
- Operator for RHOCS☆92Updated 2 weeks ago