The Intelligent Inference Scheduler for Large-scale Inference Services.
☆66Feb 12, 2026Updated 2 months ago
Alternatives and similar repositories for aigw
Users that are interested in aigw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Model Context Protocol (MCP) server implementation that enables comprehensive configuration and management of Higress.☆21Mar 29, 2025Updated last year
- An LLM Mock Server that supports simulating the protocols of all LLM providers.☆14Oct 18, 2025Updated 6 months ago
- Kubernetes CSI Driver for serving OCI model artifacts☆25Apr 29, 2026Updated last week
- RocksDB/LevelDB inspired key-value database in Go☆10Nov 3, 2020Updated 5 years ago
- 中国开发者活动日程(关注点:开源、开发者、云原生)☆24Apr 27, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- High Performance KV Cache Store for LLM☆53Apr 6, 2026Updated last month
- Volcengine TOS C++ SDK☆11Apr 29, 2026Updated last week
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated last year
- Koupleless serving system.☆12Oct 11, 2025Updated 6 months ago
- Helper libraries for Cheerp☆28Apr 1, 2026Updated last month
- Inference scheduler for llm-d☆176Updated this week
- ☆10Aug 25, 2025Updated 8 months ago
- My notes about Higress (https://higress.io/)☆25Jul 27, 2025Updated 9 months ago
- HTNN: A cloud-native gateway offering seamless extensibility for Istio and Envoy, in a native way by Go.☆122Apr 8, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 📚 经典技术书籍 PDF 文件,持续更新...☆13Jan 21, 2019Updated 7 years ago
- ☆12Jan 31, 2026Updated 3 months ago
- Dremy's 博客,React同构Web App☆11Sep 6, 2017Updated 8 years ago
- Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…☆440Updated this week
- A TimerQueue Based on Poll☆14May 13, 2019Updated 6 years ago
- db_bench log parser☆18Apr 6, 2023Updated 3 years ago
- A workload for deploying LLM inference services on Kubernetes☆212Updated this week
- A flexible serving framework that delivers efficient and fault-tolerant LLM inference for clustered deployments.☆92Apr 15, 2026Updated 3 weeks ago
- Simplified model deployment on llm-d☆28Jul 2, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- WebAssembly for Proxies (Go SDK)☆20Nov 3, 2025Updated 6 months ago
- Angular4 练习☆14Jun 20, 2017Updated 8 years ago
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆35Updated this week
- Offline optimization of your disaggregated Dynamo graph☆280Updated this week
- A Claude Code skill that generates interactive HTML walkthroughs with clickable Mermaid diagrams to explain codebase features, flows, and…☆40Mar 23, 2026Updated last month
- An advanced Git commit message generation utility designed to automatically craft high-quality commit messages with precision and sophist…☆14Apr 8, 2026Updated 3 weeks ago
- Nacos mcp wrapper Python sdk☆26Dec 23, 2025Updated 4 months ago
- A Read/Write-Optimized Tree Index for Non-Volatile Memory☆16Jul 15, 2024Updated last year
- 机器学习资源☆16May 12, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- WG Serving☆35Mar 24, 2026Updated last month
- ☆78Aug 2, 2021Updated 4 years ago
- visual studio code extension for TDengine☆10Mar 21, 2023Updated 3 years ago
- ☆13Jun 6, 2024Updated last year
- Open Source Continuous Inference Benchmarking Qwen3.5, DeepSeek, GPTOSS - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 vs H100 & soon™ TP…☆924Updated this week
- CUE配置语言资源精选(欢迎投稿)☆11Jul 22, 2024Updated last year
- Bella Openapi 实现了Claude Code依赖的 /v1/messsages 接口。所有在Bella-Openapi中接入的LLM协议均可使用Claude Code,不仅仅支持Claude系列模型,同时支持了Openai全系列、Gemini、DeepSeek、…☆16Nov 24, 2025Updated 5 months ago