tensorchord / ai-infra-landscapeLinks
This is a landscape of the infrastructure that powers the generative AI ecosystem
β151Updated last year
Alternatives and similar repositories for ai-infra-landscape
Users that are interested in ai-infra-landscape are comparing it to the libraries listed below
Sorting:
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)β279Updated 2 years ago
- π An awesome & curated list of best LLMOps tools.β183Updated 2 weeks ago
- Finetune LLMs on K8s by using Runbooksβ170Updated last year
- A diverse, simple, and secure all-in-one LLMOps platformβ109Updated last year
- Holistic job manager on Kubernetesβ115Updated last year
- This repository contains statistics about the AI Infrastructure products.β17Updated 10 months ago
- Open Model Engine (OME) β Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, Tβ¦β355Updated this week
- Pretrain, finetune and serve LLMs on Intel platforms with Rayβ131Updated 3 months ago
- β43Updated this week
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-β¦β131Updated 10 months ago
- A Site Reliability Engineer AI agent that can monitor application and infrastructure logs, diagnose issues, and report on diagnostics.β130Updated last month
- Repository for open inference protocol specificationβ61Updated 8 months ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.β140Updated this week
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β92Updated 3 months ago
- GPUd automates monitoring, diagnostics, and issue identification for GPUsβ468Updated last week
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β284Updated last month
- GenAI inference performance benchmarking toolβ140Updated 3 weeks ago
- agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.β667Updated this week
- Helm charts for llm-dβ52Updated 5 months ago
- β17Updated 6 months ago
- Your AI Kubernetes Expertβ186Updated 2 years ago
- WG Servingβ32Updated last month
- Open Weight, tool-calling LLMsβ155Updated last year
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β73Updated 5 months ago
- Backend server for envdβ22Updated 2 years ago
- Securely run AI-generated code in stateful sandboxes that run forever.β223Updated 8 months ago
- Benchmarking suite for popular AI APIsβ88Updated 11 months ago
- πͺΆ Lightweight OpenAI drop-in replacement for Kubernetesβ147Updated last year
- Embed machine learning models in your Dockerfileβ101Updated 2 weeks ago
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.β30Updated 9 months ago