openshift-psap / llm-load-testLinks
☆47Updated last week
Alternatives and similar repositories for llm-load-test
Users that are interested in llm-load-test are comparing it to the libraries listed below
Sorting:
- llm-d is a Kubernetes-native high-performance distributed LLM inference framework☆1,488Updated last week
- ☆19Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆120Updated this week
- Taxonomy tree that will allow you to create models tuned with your data☆274Updated last week
- Repository to deploy LLMs with Multi-GPUs in distributed Kubernetes nodes☆25Updated 7 months ago
- Artifacts for the Distributed Workloads stack as part of ODH☆32Updated this week
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆26Updated last month
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆138Updated last week
- Caikit is an AI toolkit that enables users to manage models through a set of developer friendly APIs.☆106Updated last month
- Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.☆132Updated last month
- Helm charts for llm-d☆51Updated 2 weeks ago
- AI-on-OpenShift website source code☆91Updated last week
- Improve ROSA customer experience (and customer retention) by leveraging foundation models to do “gpt-chat” style search of Red Hat custo…☆27Updated last year
- Collection of demos for building Llama Stack based apps on OpenShift☆51Updated last week
- GenAI inference performance benchmarking tool☆71Updated this week
- Distributed Model Serving Framework☆174Updated 2 months ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆41Updated last week
- Examples for building and running LLM services and applications locally with Podman☆170Updated last week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆461Updated last week
- Gateway API Inference Extension☆423Updated this week
- Models as a Service☆67Updated last week
- An intuitive, easy-to-use python interface for batch resource requesting, access, job submission, and observation. Simplifying the develo…☆29Updated last week
- ☆232Updated this week
- ☆16Updated 3 weeks ago
- llm-d benchmark scripts and tooling☆21Updated this week
- Controller for ModelMesh☆237Updated last month
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others☆47Updated 9 months ago
- Test Orchestrator for Performance and Scalability of AI pLatforms☆15Updated this week
- Source for the "Parasol Insurance" Lab☆81Updated last week
- NVIDIA DRA Driver for GPUs☆402Updated last week