Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)
☆57Jun 12, 2026Updated this week
Alternatives and similar repositories for auto-tuning-vllm
Users that are interested in auto-tuning-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- llm-d benchmark scripts and tooling☆63Updated this week
- A Python-based tool, trained on the state-of-the-art Google Pegasus model, specializing in generating abstracts from given YouTube video …☆10Aug 6, 2023Updated 2 years ago
- ☆12May 26, 2026Updated 3 weeks ago
- Community maintained hardware plugin for vLLM on AWS Neuron☆31May 28, 2026Updated 3 weeks ago
- AI21 Typescript SDK☆13Dec 18, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆58Aug 1, 2025Updated 10 months ago
- MicroShift in Container☆55Jun 5, 2026Updated 2 weeks ago
- AI21's Jamba models tokenizers☆33Oct 27, 2025Updated 7 months ago
- An ansible role which configures metrics collection.☆17Updated this week
- Simplified model deployment on llm-d☆29Jul 2, 2025Updated 11 months ago
- Redis Labs Test Framework☆22May 29, 2026Updated 2 weeks ago
- This project defines a json ontology standard describing a power consumption measure in a given software/hardware context, noticeably in …☆19Jun 3, 2026Updated 2 weeks ago
- Skydive WebUI☆18Jan 7, 2023Updated 3 years ago
- Pure Java Protobuf tools☆34Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Performance dashboards from the Perf & Scale team☆20May 21, 2026Updated 3 weeks ago
- Tips for running linux containers (LXC) on ChromeOS via Crostini☆18Feb 14, 2022Updated 4 years ago
- A benchmarking tool to evaluate Knative performance☆39Sep 15, 2023Updated 2 years ago
- AI21 Python SDK☆70Jan 28, 2026Updated 4 months ago
- Scan any running MCP server to produce an actionable security report of vulnerabilities and misconfigurations.☆21Nov 17, 2025Updated 7 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆1,254Updated this week
- Experimental extraction/refactoring of the Operator SDK's ansible operator plugin☆13May 20, 2026Updated 3 weeks ago
- DBus daemon for doing package action with the dnf package manager☆11Dec 20, 2023Updated 2 years ago
- Cobra is a realtime messaging server using Python3, WebSockets and Redis☆33Aug 1, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SnapDocs - A Modern, Open-Source Document Workspace☆25Sep 7, 2025Updated 9 months ago
- ☆10Mar 28, 2018Updated 8 years ago
- A Go implementation of the PCP instrumentation API☆36Jul 22, 2021Updated 4 years ago
- Linux System Roles website☆29May 12, 2026Updated last month
- ☆17Jun 11, 2026Updated last week
- Configuration to use gpg smartcards for ssh authentication☆17Nov 22, 2020Updated 5 years ago
- Development containers for triton and triton-cpu☆28Jun 3, 2026Updated 2 weeks ago
- ⚙️ Lightweight & smart Bun & Browser configuration loader.☆16Updated this week
- Default parameter values for Java via annotation processing☆36Jan 9, 2026Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [Deprecated] Vulnerability scanner for containers and images☆13Oct 26, 2015Updated 10 years ago
- International System of Units☆26May 18, 2026Updated last month
- DEPRECATED: Python client for the Google TV Pairing and Anymote protocols.☆22May 5, 2020Updated 6 years ago
- ☆10Jun 3, 2026Updated 2 weeks ago
- AI agent platform for building multi-agent systems with orchestration, memory, RAG, workflows, and enterprise observability.☆41Oct 27, 2025Updated 7 months ago
- If you need a bicycle instead of a satellite☆15Mar 26, 2019Updated 7 years ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Jan 21, 2021Updated 5 years ago