An intelligent tuner for vLLM that automatically monitors GPU metrics, uses Bayesian optimization to tune parameters
☆60Mar 12, 2026Updated last week
Alternatives and similar repositories for vllm-tuner
Users that are interested in vllm-tuner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- hello-mcp is a tour and guide for beginners to Claude Desktop MCP Config Manager, designed to help them understand MCP (Model Context Pro…☆13Mar 29, 2025Updated 11 months ago
- Apple's Cut Cross Entropy☆30Jan 19, 2025Updated last year
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 4 months ago
- Load embeddings and featurize your sentences.☆31Oct 23, 2024Updated last year
- ☆27Feb 24, 2024Updated 2 years ago
- A collection of agent-optimized LangChain, LangGraph and LangSmith skills for AI coding assistants.☆85Feb 17, 2026Updated last month
- ☆12Aug 27, 2024Updated last year
- ☆12Oct 28, 2019Updated 6 years ago
- hwpxlib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.☆36Mar 29, 2025Updated 11 months ago
- Easily stand up Keycloak and SPIRE for testing AI Agents☆29Sep 18, 2025Updated 6 months ago
- Quantnet: SFE quantlets☆11Oct 27, 2025Updated 4 months ago
- CUDA Open Source miner project, for most nvidia cards☆31Nov 30, 2018Updated 7 years ago
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- Knwler is a lightweight, single-file Python tool that extracts structured knowledge graphs from documents using AI. Feed it a PDF or text…☆61Updated this week
- A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.☆104Jul 9, 2025Updated 8 months ago
- Repo for Getting Started☆13Updated this week
- ☆12Jun 10, 2024Updated last year
- wide-dhcpv6 for Android and Chrome OS because Google won't do it.☆13Jan 3, 2019Updated 7 years ago
- An MCP server implementation providing a standardized interface for LLMs to interact with the Atla API.☆17Jul 21, 2025Updated 8 months ago
- ☆15Feb 5, 2020Updated 6 years ago
- convert OVF vm packages to smartos compatible images☆29Feb 4, 2016Updated 10 years ago
- NVIDIA Fleet Command is a hybrid-cloud platform for securely and remotely deploying, managing, and scaling AI across dozens or up to thou…☆14Jul 20, 2022Updated 3 years ago
- HPy porting of https://github.com/esnme/ultrajson☆16May 8, 2023Updated 2 years ago
- Roboflow's inference server to analyze video streams. This project extracts insights from video frames at defined intervals and generates…☆13May 21, 2024Updated last year
- ☆11Sep 29, 2014Updated 11 years ago
- ISS Tracker for the Cardputer Adv☆36Jan 19, 2026Updated 2 months ago
- [RA-L] SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization☆28Nov 24, 2025Updated 4 months ago
- [CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…☆36Mar 7, 2026Updated 2 weeks ago
- Multi-site Network Emulation, Kubeadm-installed Kubernetes, NVMe over Fabrics☆18Feb 8, 2021Updated 5 years ago
- ☆14Mar 28, 2014Updated 11 years ago
- Terminal UI for GCP (tgcp) - A terminal-based GCP resource viewer and manager☆38Mar 17, 2026Updated last week
- A simple reddit client written as a vue component.☆18Dec 12, 2025Updated 3 months ago
- Ace-Step Dataset Generator☆23Sep 27, 2025Updated 5 months ago
- 🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents☆39Dec 22, 2025Updated 3 months ago
- An eternal dialogue between AI models across versions. Started by Claude Opus 4 with 50 minutes to create a legacy.☆13Jun 2, 2025Updated 9 months ago
- ☆11Jan 31, 2015Updated 11 years ago
- This repository contains the official implementation of the paper "LandSegmenter: Towards a Flexible Foundation Model for Land Use and La…☆27Dec 8, 2025Updated 3 months ago
- ☆55Mar 5, 2026Updated 2 weeks ago
- ☆12Jun 17, 2023Updated 2 years ago