aigw-project/aigw

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aigw-project/aigw)

aigw-project / aigw

The Intelligent Inference Scheduler for Large-scale Inference Services.

☆68

Alternatives and similar repositories for aigw

Users that are interested in aigw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

higress-group / higress-ops-mcp-server
View on GitHub
A Model Context Protocol (MCP) server implementation that enables comprehensive configuration and management of Higress.
☆22Mar 29, 2025Updated last year
higress-group / mock-server
View on GitHub
An LLM Mock Server that supports simulating the protocols of all LLM providers.
☆15Updated this week
stackitcloud / kubectl-get-all
View on GitHub
Like `kubectl get all`, but get really all resources
☆34Jun 30, 2026Updated last week
skyzh / pebble
View on GitHub
RocksDB/LevelDB inspired key-value database in Go
☆10Nov 3, 2020Updated 5 years ago
pacoxu / developers-conferences-agenda
View on GitHub
中国开发者活动日程（关注点：开源、开发者、云原生）
☆26Jun 26, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hhftechnology / bandwidthlimiter
View on GitHub
bandwidth limiting middleware plugin for Traefik that provides fine-grained control over data transfer rates. This plugin supports per-ba…
☆15Apr 20, 2026Updated 2 months ago
shzshi / TestEnvironmentBooking
View on GitHub
Test Environment Booking tool
☆14Nov 16, 2020Updated 5 years ago
aibrix / PrisKV
View on GitHub
High Performance KV Cache Store for LLM
☆56May 20, 2026Updated last month
knoway-dev / knoway
View on GitHub
An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises
☆27Apr 24, 2025Updated last year
koupleless / module-controller
View on GitHub
Koupleless serving system.
☆12Oct 11, 2025Updated 8 months ago
leaningtech / cheerp-libs
View on GitHub
Helper libraries for Cheerp
☆29Jun 9, 2026Updated last month
kn71026 / awesome-programming-books-1
View on GitHub
📚 经典技术书籍 PDF 文件，持续更新...
☆13Jan 21, 2019Updated 7 years ago
mosn / htnn
View on GitHub
HTNN: A cloud-native gateway offering seamless extensibility for Istio and Envoy, in a native way by Go.
☆124Jul 2, 2026Updated last week
LinkinStars / daily-cards
View on GitHub
☆12Jan 31, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
DremyGit / dremy-blog
View on GitHub
Dremy's 博客，React同构Web App
☆11Sep 6, 2017Updated 8 years ago
ome-projects / ome
View on GitHub
Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…
☆478Updated this week
kanade2010 / TimerQueue
View on GitHub
A TimerQueue Based on Poll
☆14May 13, 2019Updated 7 years ago
warlo / rocksdb-statistics
View on GitHub
db_bench log parser
☆18Apr 6, 2023Updated 3 years ago
hongliang5316 / lua-resty-ftpclient
View on GitHub
lua-resty-ftpclient - Lua ftp client driver for the ngx_lua based on the cosocket API
☆15Feb 26, 2023Updated 3 years ago
openresty / openresty-survey
View on GitHub
OpenResty Web App for OpenResty User Survey
☆87Dec 18, 2016Updated 9 years ago
xLLM-AI / xllm-service
View on GitHub
A flexible serving framework that delivers efficient and fault-tolerant LLM inference for clustered deployments.
☆94Jun 30, 2026Updated last week
chartbeat-labs / parselmouth
View on GitHub
An object-oriented interface for abstracting away the ugly parts of ad server APIs
☆14Apr 8, 2016Updated 10 years ago
sgl-project / rbg
View on GitHub
A workload for deploying LLM inference services on Kubernetes
☆254Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
llm-d / llm-d-model-service
View on GitHub
Simplified model deployment on llm-d
☆29Jul 2, 2025Updated last year
higress-group / proxy-wasm-go-sdk
View on GitHub
WebAssembly for Proxies (Go SDK)
☆20May 25, 2026Updated last month
stormrabbit / angular4-gundam-meister
View on GitHub
Angular4 练习
☆14Jun 20, 2017Updated 9 years ago
ericaloha / L2SM
View on GitHub
L2SM (Log-assisted LSM tree) is built based on LevelDB, which is a prototype propesed to adopt a SST log structure to isolate selected ke…
☆11Jan 17, 2022Updated 4 years ago
davelet / git-intelligence-message
View on GitHub
An advanced Git commit message generation utility designed to automatically craft high-quality commit messages with precision and sophist…
☆16May 20, 2026Updated last month
nacos-group / nacos-mcp-wrapper-python
View on GitHub
Nacos mcp wrapper Python sdk
☆27Dec 23, 2025Updated 6 months ago
user-ZJ / machine_learning_resource
View on GitHub
机器学习资源
☆16May 12, 2020Updated 6 years ago
kubernetes-sigs / wg-serving
View on GitHub
WG Serving
☆38Mar 24, 2026Updated 3 months ago
api7 / apisix-mesh-agent
View on GitHub
☆79Aug 2, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
taosdata / vscode-tdengine
View on GitHub
visual studio code extension for TDengine
☆10Mar 21, 2023Updated 3 years ago
scPointer / maturin
View on GitHub
☆13Jun 6, 2024Updated 2 years ago
SemiAnalysisAI / InferenceX
View on GitHub
Open Source Continuous Inference Benchmark Research Platform — Kimi K2.7-Code, MiniMax M3, DeepSeekv4, GLM5 - GB200 NVL72 vs MI355X vs B2…
☆1,219Updated this week
skyzh / skyzh-site
View on GitHub
Alex Chi's personal site
☆23Aug 17, 2025Updated 10 months ago
copilot-io / runtime-copilot
View on GitHub
The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…
☆13May 16, 2023Updated 3 years ago
bella-top / claude-code-with-bella
View on GitHub
Bella Openapi 实现了Claude Code依赖的 /v1/messsages 接口。所有在Bella-Openapi中接入的LLM协议均可使用Claude Code，不仅仅支持Claude系列模型，同时支持了Openai全系列、Gemini、DeepSeek、…
☆17Nov 24, 2025Updated 7 months ago
zwwhdls / csibuilder
View on GitHub
csibuilder - SDK for building CSI Driver
☆36Nov 22, 2023Updated 2 years ago