Repository to deploy LLMs with Multi-GPUs in distributed Kubernetes nodes
☆30Dec 10, 2024Updated last year
Alternatives and similar repositories for multi-gpu-llms
Users that are interested in multi-gpu-llms are comparing it to the libraries listed below
Sorting:
- ☆20Dec 12, 2025Updated 2 months ago
- A lab/workshop for Red Hat OpenShift Data Science using simple fraud detection as an example workload☆21Jul 18, 2024Updated last year
- ☆24Jan 12, 2026Updated last month
- AI-on-OpenShift website source code☆105Dec 9, 2025Updated 2 months ago
- ☆17Jan 13, 2026Updated last month
- ODH Tools & Extensions Companion☆29Feb 8, 2026Updated 3 weeks ago
- Artifacts for the Distributed Workloads stack as part of ODH☆33Feb 26, 2026Updated last week
- Source for "Streamlining insurance claims with OpenShift AI" Lab☆41Aug 20, 2024Updated last year
- Collection of demos for building Llama Stack based apps on OpenShift☆61Feb 26, 2026Updated last week
- Openshift Migration UI☆11Jan 27, 2026Updated last month
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others☆59Oct 17, 2024Updated last year
- Vanilla configurations for a RHOAI instance to deploy a GenAI POC. This will deploy a vector database (Milvus), a GenAI interface (Anythi…☆14Apr 18, 2025Updated 10 months ago
- An intuitive, easy-to-use python interface for batch resource requesting, access, job submission, and observation. Simplifying the develo…☆34Feb 27, 2026Updated last week
- The AI Accelerator is a template project for setting up Red Hat OpenShift AI using GitOps☆65Feb 25, 2026Updated last week
- Source for the "Parasol Insurance" Lab☆100Nov 6, 2025Updated 4 months ago
- ☆20Nov 20, 2025Updated 3 months ago
- ☆55Apr 15, 2024Updated last year
- ☆17Jun 3, 2019Updated 6 years ago
- This repository contains the code developed for the talk "AI at the Edge with MicroShift" developed by Miguel Angel Ajo and Ricardo Norie…☆25May 8, 2023Updated 2 years ago
- LiteMaaS is a proof-of-concept application for managing LLM subscriptions, API keys, and usage tracking. It seamlessly integrates with Li…☆49Updated this week
- WG Serving☆34Dec 15, 2025Updated 2 months ago
- ☆24Jan 22, 2018Updated 8 years ago
- OpenShift Pipelines Examples based on Tekton☆31May 12, 2022Updated 3 years ago
- Setting up an OpenShift cluster using Kustomize and ArgoCD☆34May 4, 2022Updated 3 years ago
- ☆11Sep 21, 2022Updated 3 years ago
- This is the Hobbyist Guide to Installing and Configuring RHOAI for customers. Bring your towel.☆14Apr 15, 2025Updated 10 months ago
- This is AutoGenDemo☆11Mar 12, 2024Updated last year
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆50Updated this week
- Kubernetes service bindings utility☆15Feb 14, 2026Updated 2 weeks ago
- HashiCorp Vault configuration provider implementation for Microsoft.Extensions.Configuration.☆11Jun 23, 2022Updated 3 years ago
- Red Hat Enterprise Linux AI -- Developer Preview☆171Jul 9, 2024Updated last year
- OpenShift and Hashicorp Vault Integration☆39Dec 9, 2025Updated 2 months ago
- ☆10Mar 20, 2020Updated 5 years ago
- R package for building flexible workflows☆13Oct 16, 2019Updated 6 years ago
- This repo contains the follow-along student instructions for the lab. https://rhoai-mlops.github.io/lab-instructions/☆14Feb 26, 2026Updated last week
- ☆11Apr 29, 2024Updated last year
- A grafana Dockerfile for OpenShift☆10Mar 13, 2020Updated 5 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- Official repository for Activation-Informed Merging (AIM) of Large Language Models☆21Feb 10, 2025Updated last year