openshift-psap / llm-load-test
☆40Updated last month
Alternatives and similar repositories for llm-load-test:
Users that are interested in llm-load-test are comparing it to the libraries listed below
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others☆36Updated 6 months ago
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆24Updated last week
- ☆117Updated this week
- Artifacts for the Distributed Workloads stack as part of ODH☆29Updated last week
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆33Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆92Updated this week
- Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.☆118Updated 3 weeks ago
- Repository to deploy LLMs with Multi-GPUs in distributed Kubernetes nodes☆22Updated 4 months ago
- AI-on-OpenShift website source code☆81Updated 2 weeks ago
- Containerization and cloud native suite for OPEA☆54Updated this week
- ☆16Updated last week
- ☆19Updated last week
- Models as a Service☆51Updated last month
- An intuitive, easy-to-use python interface for batch resource requesting, access, job submission, and observation. Simplifying the develo…☆26Updated last week
- ☆23Updated last week
- Examples for building and running LLM services and applications locally with Podman☆149Updated this week
- Improve ROSA customer experience (and customer retention) by leveraging foundation models to do “gpt-chat” style search of Red Hat custo…☆27Updated last year
- The AI Accelerator is a template project for setting up Red Hat OpenShift AI using GitOps☆49Updated last week
- Open Data Hub operator to manage ODH component integrations☆76Updated this week
- Source for "Streamlining insurance claims with OpenShift AI" Lab☆39Updated 8 months ago
- ODH integration with AI at the Edge usecases☆12Updated 5 months ago
- Test Orchestrator for Performance and Scalability of AI pLatforms☆14Updated this week
- Model Server for Kepler☆27Updated 2 weeks ago
- Source for the "Parasol Insurance" Lab☆65Updated 3 months ago
- Controller for ModelMesh☆228Updated last month
- Collection of demos for building Llama Stack based apps on OpenShift☆20Updated last week
- A repository for Open Data Hub Kustomize manifests extending upstream Kubeflow manifests☆62Updated last year
- GenAI inference performance benchmarking tool☆39Updated 3 weeks ago
- Distributed Model Serving Framework☆163Updated last month
- Summarize Financial Data with a RAG workflow using Weaviate, Red Hat OpenShift and Red Hat Build of Apache Camel.☆20Updated last month