The Intelligent Inference Scheduler for Large-scale Inference Services.
☆65Feb 12, 2026Updated last month
Alternatives and similar repositories for aigw
Users that are interested in aigw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Model Context Protocol (MCP) server implementation that enables comprehensive configuration and management of Higress.☆22Mar 29, 2025Updated 11 months ago
- An LLM Mock Server that supports simulating the protocols of all LLM providers.☆11Oct 18, 2025Updated 5 months ago
- Kubernetes CSI Driver for serving OCI model artifacts☆24Updated this week
- Like `kubectl get all`, but get really all resources☆29Mar 20, 2026Updated last week
- RocksDB/LevelDB inspired key-value database in Go☆10Nov 3, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- bandwidth limiting middleware plugin for Traefik that provides fine-grained control over data transfer rates. This plugin supports per-ba…☆15May 12, 2025Updated 10 months ago
- Test Environment Booking tool☆14Nov 16, 2020Updated 5 years ago
- Helper libraries for Cheerp☆28Mar 11, 2026Updated 2 weeks ago
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated 11 months ago
- Koupleless serving system.☆11Oct 11, 2025Updated 5 months ago
- ☆10Aug 25, 2025Updated 7 months ago
- A tool for encypt or decrypt file dedicated for our forked TensorFlow Serving | 加解密文件工具,用于加密模型pb文件,提供给加密定制后的TensorFlow Serving服务☆12Jan 16, 2023Updated 3 years ago
- My notes about Higress (https://higress.io/)☆24Jul 27, 2025Updated 8 months ago
- HTNN: A cloud-native gateway offering seamless extensibility for Istio and Envoy, in a native way by Go.☆123Mar 19, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 📚 经典技术书籍 PDF 文件,持续更新...☆13Jan 21, 2019Updated 7 years ago
- Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…☆404Updated this week
- Drawing Comparison Figures in Scientific Research Papers, includes lines and bars.☆11Mar 22, 2024Updated 2 years ago
- db_bench log parser☆18Apr 6, 2023Updated 2 years ago
- A workload for deploying LLM inference services on Kubernetes☆192Updated this week
- An object-oriented interface for abstracting away the ugly parts of ad server APIs☆14Apr 8, 2016Updated 9 years ago
- Simplified model deployment on llm-d☆28Jul 2, 2025Updated 8 months ago
- WebAssembly for Proxies (Go SDK)☆19Nov 3, 2025Updated 4 months ago
- vLLM Daily Summarization of Merged PRs☆48Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Angular4 练习☆14Jun 20, 2017Updated 8 years ago
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆35Mar 14, 2026Updated last week
- Experimental Bookie C++ implementation☆15Jun 28, 2016Updated 9 years ago
- Open Source Continuous Inference Benchmarking Qwen3.5, DeepSeek, GPTOSS - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 vs H100 & soon™ TP…☆717Updated this week
- An advanced Git commit message generation utility designed to automatically craft high-quality commit messages with precision and sophist…☆16Mar 19, 2026Updated last week
- A Read/Write-Optimized Tree Index for Non-Volatile Memory☆16Jul 15, 2024Updated last year
- WG Serving☆34Mar 5, 2026Updated 3 weeks ago
- 机器学习资源☆16May 12, 2020Updated 5 years ago
- ☆79Aug 2, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A playground to experiment with Raft proposal pipeline optimization☆16Nov 4, 2022Updated 3 years ago
- ☆19Dec 31, 2022Updated 3 years ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- 🎨 Readability enhanced, clean, nice looking colors palette we used across our Project AIRI projects!☆21Nov 25, 2025Updated 4 months ago
- LazyXds enables Istio only push needed xDS to sidecars to reduce resource consumption and speed up xDS configuration propagation.☆27Jul 5, 2023Updated 2 years ago
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆292Jan 26, 2026Updated 2 months ago
- space-agent, as the carrier of AO.space all-in-one, mainly provides a unified entrance for AO.space server to start.☆10Feb 6, 2026Updated last month