AI-Hypercomputer/inference-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AI-Hypercomputer/inference-benchmark)

AI-Hypercomputer / inference-benchmark

☆18

Alternatives and similar repositories for inference-benchmark

Users that are interested in inference-benchmark are comparing it to the libraries listed below

Sorting:

openshift-psap / topsail
View on GitHub
Test Orchestrator for Performance and Scalability of AI pLatforms
☆16Feb 20, 2026Updated last week
knoway-dev / knoway
View on GitHub
An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises
☆26Apr 24, 2025Updated 10 months ago
NVIDIA / k8s-operator-libs
View on GitHub
A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.
☆29Feb 15, 2026Updated last week
kaniuse / kaniuse
View on GitHub
caniuse.com, but for kubernetes
☆27Dec 25, 2024Updated last year
kubernetes-sigs / wg-serving
View on GitHub
WG Serving
☆34Dec 15, 2025Updated 2 months ago
fleetboard-io / fleetboard
View on GitHub
Fleetboard establishes an independent and unified parallel network, facilitating cross-cluster service discovery even in cases of IP over…
☆31May 13, 2025Updated 9 months ago
openshift / open-service-broker-sdk
View on GitHub
A starting point for creating service brokers implementing the Open Service Broker API
☆30Aug 11, 2017Updated 8 years ago
tensorchord / deepseek-api-arena
View on GitHub
A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.
☆30Mar 28, 2025Updated 10 months ago
llm-d-incubation / llm-d-infra
View on GitHub
llm-d helm charts and deployment examples
☆50Feb 19, 2026Updated last week
kubernetes-sigs / gateway-api-inference-extension
View on GitHub
Gateway API Inference Extension
☆594Updated this week
kubernetes-sigs / inference-perf
View on GitHub
GenAI inference performance benchmarking tool
☆151Updated this week
kubeflex-io / kubeflex
View on GitHub
☆13Dec 24, 2024Updated last year
lightseekorg / smg
View on GitHub
Shepherd Model Gateway
☆59Updated this week
Project-HAMi / dcu-vgpu-device-plugin
View on GitHub
☆15Aug 7, 2025Updated 6 months ago
volcengine / AICC-Trusted-MCP
View on GitHub
☆33Dec 26, 2025Updated 2 months ago
llm-d / llm-d-deployer
View on GitHub
Helm charts for llm-d
☆52Jul 22, 2025Updated 7 months ago
llm-d / llm-d-inference-sim
View on GitHub
A light weight vLLM simulator, for mocking out replicas.
☆87Updated this week
luskits / luscsi
View on GitHub
Provides deploy scripts and CSI for Lustre.
☆14Oct 27, 2025Updated 4 months ago
fleeto / issueflow
View on GitHub
Workflow based on github issues.
☆11Apr 30, 2019Updated 6 years ago
heiyhia / chinamap-panel
View on GitHub
请移步Echarts-panel
☆13Nov 7, 2017Updated 8 years ago
pdm-project / pdm-shear
View on GitHub
Detect and remove unused dependencies for Python projects
☆18Apr 5, 2025Updated 10 months ago
wy-z / tproto
View on GitHub
Parse golang data structure into proto3.
☆11Feb 6, 2018Updated 8 years ago
lfai / artwork
View on GitHub
LF AI & Data Foundation related logos and artwork
☆11Jan 30, 2026Updated 3 weeks ago
copilot-io / runtime-copilot
View on GitHub
The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…
☆12May 16, 2023Updated 2 years ago
jbfavre / docker-dataiku-dss
View on GitHub
Docker image for Dataiku Science Studio
☆10Apr 20, 2017Updated 8 years ago
gdamore / chanstream
View on GitHub
Package chanstream implements an API compatible with and similiar to the TCP connection (and net.Conn as well) API, on top of Go channels…
☆14Sep 2, 2020Updated 5 years ago
shenxg13 / istio-no-best-practice
View on GitHub
☆10Mar 18, 2019Updated 6 years ago
georgetown-cset / CSET-AIID-harm-taxonomy
View on GitHub
Repo containing documentation and explanation for CSET's harm taxonomy of incidents from AIID.
☆18Jun 21, 2024Updated last year
ai-dynamo / modelexpress
View on GitHub
Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and i…
☆30Updated this week
microsoft / NTT
View on GitHub
Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation [ICML 2021]
☆15Jul 17, 2025Updated 7 months ago
iankoulski / do-framework
View on GitHub
Do Framework Definition
☆16Sep 13, 2024Updated last year
containerd / typeurl
View on GitHub
Go package for managing marshaled types to protobuf.Any
☆55Nov 7, 2024Updated last year
devfile / kubectl-debug-ide
View on GitHub
A kubectl plugin to debug Pods from an IDE rather than the CLI
☆10Dec 19, 2024Updated last year
opea-project / Enterprise-Inference
View on GitHub
Intel® AI for Enterprise Inference optimizes AI inference services on Intel hardware using Kubernetes Orchestration. It automates LLM mod…
☆36Feb 20, 2026Updated last week
AkihiroSuda / kina
View on GitHub
Kubernetes in Apple Containerization
☆41Nov 17, 2025Updated 3 months ago
run-ai / kwok-operator
View on GitHub
☆16Jul 18, 2025Updated 7 months ago
kubernetes-sigs / azurelustre-csi-driver
View on GitHub
☆16Feb 6, 2026Updated 3 weeks ago
bobmayuze / Earning-Sage
View on GitHub
An earning call robot built with LLM
☆10Aug 4, 2023Updated 2 years ago
viacoin / atomicswap
View on GitHub
Decred: On-chain atomic swaps for Viacoin, Litecoin and other cryptocurrencies.
☆12Jan 30, 2023Updated 3 years ago