tensorchord / ai-infra-landscapeLinks

This is a landscape of the infrastructure that powers the generative AI ecosystem

☆148

Alternatives and similar repositories for ai-infra-landscape

Users that are interested in ai-infra-landscape are comparing it to the libraries listed below

Sorting:

tensorchord / openmodelz
Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)
☆270Updated last year
InftyAI / Awesome-LLMOps
🎉 An awesome & curated list of best LLMOps tools.
☆141Updated this week
substratusai / runbooks
Finetune LLMs on K8s by using Runbooks
☆169Updated 11 months ago
sgl-project / ome
OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)
☆202Updated this week
intel / llm-on-ray
Pretrain, finetune and serve LLMs on Intel platforms with Ray
☆128Updated 3 weeks ago
kubeagi / arcadia
A diverse, simple, and secure all-in-one LLMOps platform
☆107Updated 10 months ago
run-ai / runai-model-streamer
☆231Updated this week
project-codeflare / multi-cluster-app-dispatcher
Holistic job manager on Kubernetes
☆117Updated last year
kserve / open-inference-protocol
Repository for open inference protocol specification
☆59Updated 2 months ago
NVIDIA / k8s-nim-operator
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
☆119Updated this week
llmariner / llmariner
Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.
☆87Updated this week
prem-research / prem-operator
📡 Deploy AI models and apps to Kubernetes without developing a hernia
☆32Updated last year
leptonai / gpud
GPUd automates monitoring, diagnostics, and issue identification for GPUs
☆405Updated this week
traceloop / hub
High-scale LLM gateway, written in Rust. OpenTelemetry-based observability included
☆116Updated 2 weeks ago
NVIDIA / knavigator
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
☆69Updated 2 weeks ago
coreweave / ml-containers
☆38Updated this week
rubra-ai / rubra
Open Weight, tool-calling LLMs
☆154Updated 9 months ago
NVIDIA / ais-k8s
Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.
☆106Updated this week
NVIDIA / topograph
A toolkit for discovering cluster network topology.
☆59Updated last week
tensorchord / deepseek-api-arena
A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.
☆29Updated 4 months ago
InftyAI / llmaz
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
☆228Updated last week
ray-project / kuberay-helm
Helm charts for the KubeRay project
☆50Updated 2 weeks ago
llm-d / llm-d-deployer
Helm charts for llm-d
☆51Updated last week
kubernetes-sigs / jobset
JobSet: a k8s native API for distributed ML training and HPC workloads
☆246Updated this week
bentoml / BentoVLLM
Self-host LLMs with vLLM and BentoML
☆138Updated last week
flame-sh / flame
A distributed engine for elastic workload
☆27Updated this week
chenhunghan / ialacol
🪶 Lightweight OpenAI drop-in replacement for Kubernetes
☆146Updated last year
depot / depot.ai
Embed machine learning models in your Dockerfile
☆93Updated last week
langfuse / oss-llmops-stack
Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…
☆107Updated 5 months ago
gptscript-ai / knowledge
Knowledge for GPTScript
☆29Updated 9 months ago