Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)
☆42Mar 2, 2026Updated last week
Alternatives and similar repositories for auto-tuning-vllm
Users that are interested in auto-tuning-vllm are comparing it to the libraries listed below
Sorting:
- llm-d benchmark scripts and tooling☆48Mar 2, 2026Updated last week
- SnapDocs - A Modern, Open-Source Document Workspace☆25Sep 7, 2025Updated 6 months ago
- Saurus CMS Community Edition☆26Aug 11, 2015Updated 10 years ago
- Protocol buffers and other common resources.☆13Mar 2, 2026Updated last week
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆13Dec 31, 2024Updated last year
- ☆11Feb 27, 2026Updated last week
- ☆54Aug 1, 2025Updated 7 months ago
- Find the idea for your next project/startup posted by people all over the world. Alternatively, post your idea over the platform and allo…☆11May 13, 2023Updated 2 years ago
- Ibexa Experience is a modern modular Digital Experience Platform (DXP) designed for customer-centric companies and organizations who want…☆10Updated this week
- 🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.☆13Jul 12, 2025Updated 7 months ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Jan 21, 2021Updated 5 years ago
- Vectorize HTML files and generate embeddings with structural and semantic expression (WIP)☆11Feb 16, 2023Updated 3 years ago
- redis module unit tests with python (deprecated) please see RLTest☆12Sep 8, 2019Updated 6 years ago
- OpenCopilot flows editor☆11Oct 31, 2023Updated 2 years ago
- Python script to generate geomantic divination charts☆10Nov 12, 2019Updated 6 years ago
- This project defines a json ontology standard describing a power consumption measure in a given software/hardware context, noticeably in …☆15Mar 2, 2026Updated last week
- The repo of the Doc2SoarGraph framework☆10Sep 17, 2024Updated last year
- A Partytown plugin for Fresh☆12Oct 10, 2023Updated 2 years ago
- Astrology app, with birth chart calculation based on your time and place of birth.☆12Aug 24, 2021Updated 4 years ago
- [ICML 2025] Efficiently Serving Large Multimodal Models Using EPD Disaggregation☆22May 29, 2025Updated 9 months ago
- ☆13Jan 7, 2025Updated last year
- Memory optimized Mixture of Experts☆74Jul 25, 2025Updated 7 months ago
- ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/vide…☆21May 5, 2024Updated last year
- Deprecated version of CSK, see new one here:☆14Feb 18, 2025Updated last year
- OpenAI compatible API for open source LLMs☆16Oct 30, 2023Updated 2 years ago
- A data preprocessor for the Quranic Treebank using neural networks. Divides longer verses into smaller chunks.☆12Jul 4, 2023Updated 2 years ago
- Pure Java Protobuf tools☆29Updated this week
- AutoML 2024: HPOD: Hyperparameter Optimization for Unsupervised Outlier Detection☆12Jul 12, 2024Updated last year
- ☆11Apr 21, 2020Updated 5 years ago
- ☆16Oct 27, 2025Updated 4 months ago
- Adds some mediawiki features to meteorpedia, {{}}, [[]], categories, tables, links, etc.☆17Feb 3, 2014Updated 12 years ago
- Multicoin is a branch of Bitcoin a crypto currency client with experimental and advanced features of multi currency and escrow and soon o…☆22Aug 25, 2021Updated 4 years ago
- DSPy Experiments☆10Aug 28, 2025Updated 6 months ago
- A JavaScript library for creating and editing videos in the browser.☆20Updated this week
- Enterprise Learner Portal☆16Updated this week
- This repository contains a ready-to-use boilerplate for quickly setting up and working with crewai. It provides essential configurations …☆11Sep 11, 2024Updated last year
- A statistical framework for graph anomaly detection.☆17Sep 23, 2018Updated 7 years ago
- 🚸 Introducing Lifetable, add the missing all-in-one community to the spreadsheet database ecology and so much more. Based on Next.js 14 …☆12May 14, 2024Updated last year
- A modern theme for MediaWiki, built on Bootystrap 3 and Skinny.☆12Oct 27, 2016Updated 9 years ago