Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)
☆51Mar 17, 2026Updated last month
Alternatives and similar repositories for auto-tuning-vllm
Users that are interested in auto-tuning-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- llm-d benchmark scripts and tooling☆55Apr 11, 2026Updated last week
- A Python-based tool, trained on the state-of-the-art Google Pegasus model, specializing in generating abstracts from given YouTube video …☆10Aug 6, 2023Updated 2 years ago
- ☆12Updated this week
- Community maintained hardware plugin for vLLM on AWS Neuron☆28Mar 20, 2026Updated 3 weeks ago
- AI21 Typescript SDK☆13Dec 18, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Golang library for analyzing k8s connectivity-configuration resources (a.k.a. network policies)☆19Feb 1, 2026Updated 2 months ago
- A performance testing and analysis automation framework☆15Updated this week
- Simplified model deployment on llm-d☆28Jul 2, 2025Updated 9 months ago
- Redis Labs Test Framework☆22Apr 9, 2026Updated last week
- Ansible roles for the Performance Co-Pilot toolkit☆22Apr 10, 2026Updated last week
- A place for large proposed change for Valkey.☆21Oct 27, 2025Updated 5 months ago
- ☆41Updated this week
- PCP BCC PMDA☆17Oct 1, 2018Updated 7 years ago
- This is a side project where me and my friend try to generate synthetic data in bangla from deepseek-r1. So that can be used for model di…☆11Jun 28, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Rust implementation of the PCP instrumentation API☆35Jan 13, 2018Updated 8 years ago
- Skydive WebUI☆18Jan 7, 2023Updated 3 years ago
- Train and finutune text-to-speech models for Bengali and many other languages!☆18Apr 2, 2025Updated last year
- Ultra-fast audio super resolution custom node for ComfyUI, powered by the NovaSR model.☆30Feb 12, 2026Updated 2 months ago
- Tips for running linux containers (LXC) on ChromeOS via Crostini☆18Feb 14, 2022Updated 4 years ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆1,016Updated this week
- 💻 SETA: Scaling Environments for Terminal Agents - Environments☆126Feb 16, 2026Updated 2 months ago
- A local OSBS development environment☆10Jan 4, 2022Updated 4 years ago
- Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.☆138Mar 8, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- SnapDocs - A Modern, Open-Source Document Workspace☆25Sep 7, 2025Updated 7 months ago
- Practical Machine Learning with LightGBM and Python, published by Packt☆28Dec 15, 2025Updated 4 months ago
- Memory optimized Mixture of Experts☆75Jul 25, 2025Updated 8 months ago
- A Go implementation of the PCP instrumentation API☆36Jul 22, 2021Updated 4 years ago
- Development containers for triton and triton-cpu☆27Updated this week
- ⚙️ Lightweight & smart Bun & Browser configuration loader.☆15Updated this week
- [Deprecated] Vulnerability scanner for containers and images☆13Oct 26, 2015Updated 10 years ago
- Content for website and man pages☆40Mar 17, 2026Updated last month
- Fine tuned llama 3 models for context based question answering in bengali language.☆18Oct 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A collection of single-host tests for Atomic Host☆18Nov 18, 2021Updated 4 years ago
- A Partytown plugin for Fresh☆12Oct 10, 2023Updated 2 years ago
- A sub-RFC1928 SOCKS5 server implementation in Go with zero external dependencies.☆13Sep 5, 2023Updated 2 years ago
- ☆11Apr 9, 2026Updated last week
- caro: fast Rust CLI that turns natural‑language tasks into a safe POSIX command. Built for macOS (MLX/Metal) with a built‑in model; suppo…☆31Updated this week
- AI agent platform for building multi-agent systems with orchestration, memory, RAG, workflows, and enterprise observability.☆29Oct 27, 2025Updated 5 months ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Jan 21, 2021Updated 5 years ago