openshift-psap / llm-load-testLinks
☆51Updated 4 months ago
Alternatives and similar repositories for llm-load-test
Users that are interested in llm-load-test are comparing it to the libraries listed below
Sorting:
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆730Updated last week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆140Updated last week
- ☆20Updated last week
- Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.☆142Updated 3 months ago
- Helm charts for llm-d☆50Updated 4 months ago
- Repository to deploy LLMs with Multi-GPUs in distributed Kubernetes nodes☆28Updated 11 months ago
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆157Updated this week
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆27Updated 3 months ago
- Achieve state of the art inference performance with modern accelerators on Kubernetes☆2,120Updated last week
- llm-d helm charts and deployment examples☆47Updated 2 weeks ago
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others☆54Updated last year
- Collection of demos for building Llama Stack based apps on OpenShift☆56Updated last month
- Distributed Model Serving Framework☆179Updated 2 months ago
- Artifacts for the Distributed Workloads stack as part of ODH☆33Updated last week
- llm-d benchmark scripts and tooling☆36Updated this week
- Controller for ModelMesh☆242Updated 5 months ago
- GenAI inference performance benchmarking tool☆134Updated last week
- Taxonomy tree that will allow you to create models tuned with your data☆287Updated 3 months ago
- ☆268Updated last week
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling☆119Updated this week
- Improve ROSA customer experience (and customer retention) by leveraging foundation models to do “gpt-chat” style search of Red Hat custo…☆28Updated last year
- Test Orchestrator for Performance and Scalability of AI pLatforms☆16Updated this week
- Examples for building and running LLM services and applications locally with Podman☆184Updated 4 months ago
- Red Hat Enterprise Linux AI -- Developer Preview☆168Updated last year
- Models as a Service☆72Updated last month
- AI-on-OpenShift website source code☆100Updated this week
- OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)☆324Updated this week
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆48Updated last week
- Containerization and cloud native suite for OPEA☆72Updated 2 months ago
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆216Updated last week