openshift-psap / llm-load-test
☆40Updated last month
Alternatives and similar repositories for llm-load-test
Users that are interested in llm-load-test are comparing it to the libraries listed below
Sorting:
- Artifacts for the Distributed Workloads stack as part of ODH☆30Updated last week
- ☆19Updated 3 weeks ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆103Updated this week
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆35Updated this week
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆125Updated this week
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆25Updated this week
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others☆40Updated 6 months ago
- ☆16Updated this week
- Examples for building and running LLM services and applications locally with Podman☆153Updated this week
- Models as a Service☆55Updated 2 months ago
- Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.☆121Updated 2 weeks ago
- An intuitive, easy-to-use python interface for batch resource requesting, access, job submission, and observation. Simplifying the develo…☆27Updated last week
- Test Orchestrator for Performance and Scalability of AI pLatforms☆14Updated this week
- Repository to deploy LLMs with Multi-GPUs in distributed Kubernetes nodes☆22Updated 5 months ago
- Distributed Model Serving Framework☆165Updated this week
- ☆23Updated 3 weeks ago
- Containerization and cloud native suite for OPEA☆61Updated this week
- Controller for ModelMesh☆229Updated this week
- Caikit is an AI toolkit that enables users to manage models through a set of developer friendly APIs.☆105Updated 7 months ago
- AI-on-OpenShift website source code☆81Updated 2 weeks ago
- Open Data Hub operator to manage ODH component integrations☆77Updated this week
- ☆207Updated last week
- Cloud Native Benchmarking of Foundation Models☆33Updated 6 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆60Updated last week
- Source for the "Parasol Insurance" Lab☆67Updated 2 weeks ago
- Collection of demos for building Llama Stack based apps on OpenShift☆24Updated this week
- ☆15Updated 11 months ago
- Improve ROSA customer experience (and customer retention) by leveraging foundation models to do “gpt-chat” style search of Red Hat custo…☆27Updated last year
- Gateway API Inference Extension☆272Updated this week
- A tool to detect infrastructure issues on cloud native AI systems☆35Updated last week