tensorchord / ai-infra-landscapeLinks
This is a landscape of the infrastructure that powers the generative AI ecosystem
☆151Updated last year
Alternatives and similar repositories for ai-infra-landscape
Users that are interested in ai-infra-landscape are comparing it to the libraries listed below
Sorting:
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆279Updated 2 years ago
- Finetune LLMs on K8s by using Runbooks☆170Updated last year
- 🎉 An awesome & curated list of best LLMOps tools.☆189Updated last week
- A diverse, simple, and secure all-in-one LLMOps platform☆109Updated last year
- Kubernetes-native AI serving platform for scalable model serving.☆198Updated this week
- This repository contains statistics about the AI Infrastructure products.☆17Updated 11 months ago
- Holistic job manager on Kubernetes☆116Updated last year
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆142Updated last week
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆287Updated last week
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆131Updated 4 months ago
- Repository for open inference protocol specification☆63Updated 8 months ago
- Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…☆365Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆74Updated 6 months ago
- Open Weight, tool-calling LLMs☆156Updated last year
- GPUd automates monitoring, diagnostics, and issue identification for GPUs☆472Updated this week
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.☆93Updated 3 months ago
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆124Updated this week
- A distributed system for Agentic AI☆43Updated this week
- GenAI inference performance benchmarking tool☆141Updated last week
- Your AI Kubernetes Expert☆186Updated 2 years ago
- Helm charts for llm-d☆52Updated 6 months ago
- Backend server for envd☆22Updated 2 years ago
- A toolkit for discovering cluster network topology.☆96Updated this week
- ☆44Updated this week
- ☆67Updated 10 months ago
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆204Updated 3 years ago
- ☆278Updated 2 weeks ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Updated 5 years ago
- Distributed KV cache scheduling & offloading libraries☆98Updated last week
- Documentation repository for NVIDIA Cloud Native Technologies☆35Updated last week