tensorchord / ai-infra-landscapeLinks
This is a landscape of the infrastructure that powers the generative AI ecosystem
β150Updated last year
Alternatives and similar repositories for ai-infra-landscape
Users that are interested in ai-infra-landscape are comparing it to the libraries listed below
Sorting:
- π An awesome & curated list of best LLMOps tools.β172Updated this week
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)β277Updated 2 years ago
- Finetune LLMs on K8s by using Runbooksβ170Updated last year
- β42Updated last week
- Pretrain, finetune and serve LLMs on Intel platforms with Rayβ130Updated 2 months ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.β140Updated last week
- Repository for open inference protocol specificationβ60Updated 6 months ago
- A diverse, simple, and secure all-in-one LLMOps platformβ109Updated last year
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β72Updated 4 months ago
- OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)β322Updated this week
- agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.β397Updated last week
- Holistic job manager on Kubernetesβ115Updated last year
- High-scale LLM gateway, written in Rust. OpenTelemetry-based observability includedβ143Updated last week
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β270Updated last week
- GPUd automates monitoring, diagnostics, and issue identification for GPUsβ456Updated this week
- Helm charts for llm-dβ50Updated 4 months ago
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.β113Updated last week
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.β30Updated 8 months ago
- llm-d helm charts and deployment examplesβ46Updated last week
- Benchmarking suite for popular AI APIsβ88Updated 9 months ago
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymenβ¦β213Updated last week
- A toolkit for discovering cluster network topology.β84Updated last week
- β267Updated last week
- β64Updated 8 months ago
- CUDA checkpoint and restore utilityβ393Updated 2 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.β79Updated last year
- β17Updated 5 months ago
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β90Updated last month
- GenAI inference performance benchmarking toolβ133Updated last week
- JobSet: a k8s native API for distributed ML training and HPC workloadsβ282Updated last week