The Intelligent Inference Scheduler for Large-scale Inference Services.
☆66Feb 12, 2026Updated 2 months ago
Alternatives and similar repositories for aigw
Users that are interested in aigw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Model Context Protocol (MCP) server implementation that enables comprehensive configuration and management of Higress.☆22Mar 29, 2025Updated last year
- An LLM Mock Server that supports simulating the protocols of all LLM providers.☆12Oct 18, 2025Updated 5 months ago
- Kubernetes CSI Driver for serving OCI model artifacts☆25Mar 23, 2026Updated 3 weeks ago
- Like `kubectl get all`, but get really all resources☆30Apr 8, 2026Updated last week
- RocksDB/LevelDB inspired key-value database in Go☆10Nov 3, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 中国开发者活动日程(关注点:开源、开发者、云原生)☆24Updated this week
- Test Environment Booking tool☆14Nov 16, 2020Updated 5 years ago
- High Performance KV Cache Store for LLM☆53Apr 6, 2026Updated last week
- Helper libraries for Cheerp☆28Apr 1, 2026Updated 2 weeks ago
- Volcengine TOS C++ SDK☆11Mar 30, 2026Updated 2 weeks ago
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated 11 months ago
- Koupleless serving system.☆11Oct 11, 2025Updated 6 months ago
- Inference scheduler for llm-d☆163Updated this week
- ☆10Aug 25, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- HTNN: A cloud-native gateway offering seamless extensibility for Istio and Envoy, in a native way by Go.☆123Apr 8, 2026Updated last week
- ☆12Jan 31, 2026Updated 2 months ago
- Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…☆422Updated this week
- Dremy's 博客,React同构Web App☆11Sep 6, 2017Updated 8 years ago
- A workload for deploying LLM inference services on Kubernetes☆203Updated this week
- WebAssembly for Proxies (Go SDK)☆20Nov 3, 2025Updated 5 months ago
- Offline optimization of your disaggregated Dynamo graph☆255Updated this week
- Angular4 练习☆14Jun 20, 2017Updated 8 years ago
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆35Mar 31, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Experimental Bookie C++ implementation☆15Jun 28, 2016Updated 9 years ago
- ☆15Jul 5, 2024Updated last year
- A Read/Write-Optimized Tree Index for Non-Volatile Memory☆16Jul 15, 2024Updated last year
- WG Serving☆34Mar 24, 2026Updated 3 weeks ago
- ☆79Aug 2, 2021Updated 4 years ago
- visual studio code extension for TDengine☆10Mar 21, 2023Updated 3 years ago
- A playground to experiment with Raft proposal pipeline optimization☆16Nov 4, 2022Updated 3 years ago
- CUE配置语言资源精选(欢迎投稿)☆11Jul 22, 2024Updated last year
- Alex Chi's personal site☆23Aug 17, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Bella Openapi 实现了Claude Code依赖的 /v1/messsages 接口。所有在Bella-Openapi中接入的LLM协议均可使用Claude Code,不仅仅支持Claude系列模型,同时支持了Openai全系列、Gemini、DeepSeek、…☆16Nov 24, 2025Updated 4 months ago
- ☆19Dec 31, 2022Updated 3 years ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- 🎨 Readability enhanced, clean, nice looking colors palette we used across our Project AIRI projects!☆22Nov 25, 2025Updated 4 months ago
- 2020华为软件精英挑战赛,西北赛区,一方通行☆18Jun 27, 2020Updated 5 years ago
- LazyXds enables Istio only push needed xDS to sidecars to reduce resource consumption and speed up xDS configuration propagation.☆27Jul 5, 2023Updated 2 years ago
- Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serv…☆291Apr 2, 2026Updated 2 weeks ago