openshift-psap / llm-load-testLinks
☆49Updated last month
Alternatives and similar repositories for llm-load-test
Users that are interested in llm-load-test are comparing it to the libraries listed below
Sorting:
- llm-d enables high-performance distributed LLM inference on Kubernetes☆1,755Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆577Updated this week
- ☆19Updated this week
- Helm charts for llm-d☆50Updated last month
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆27Updated 3 weeks ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆125Updated last week
- Taxonomy tree that will allow you to create models tuned with your data☆281Updated last week
- Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.☆136Updated 2 weeks ago
- Examples for building and running LLM services and applications locally with Podman☆176Updated last month
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆191Updated last week
- Repository to deploy LLMs with Multi-GPUs in distributed Kubernetes nodes☆27Updated 9 months ago
- Improve ROSA customer experience (and customer retention) by leveraging foundation models to do “gpt-chat” style search of Red Hat custo…☆27Updated last year
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling☆46Updated this week
- ☆241Updated last week
- Artifacts for the Distributed Workloads stack as part of ODH☆33Updated this week
- Red Hat Enterprise Linux AI -- Developer Preview☆167Updated last year
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆149Updated this week
- AI-on-OpenShift website source code☆92Updated last month
- Distributed Model Serving Framework☆177Updated this week
- Test Orchestrator for Performance and Scalability of AI pLatforms☆15Updated last week
- Collection of demos for building Llama Stack based apps on OpenShift☆55Updated 3 weeks ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆44Updated this week
- Containerization and cloud native suite for OPEA☆70Updated 3 weeks ago
- GenAI inference performance benchmarking tool☆95Updated this week
- Models as a Service☆68Updated this week
- ☆16Updated last week
- Caikit is an AI toolkit that enables users to manage models through a set of developer friendly APIs.☆107Updated 2 months ago
- Controller for ModelMesh☆238Updated 3 months ago
- llm-d benchmark scripts and tooling☆25Updated this week
- IBM development fork of https://github.com/huggingface/text-generation-inference☆61Updated this week