tensorchord / ai-infra-landscapeLinks
This is a landscape of the infrastructure that powers the generative AI ecosystem
β149Updated last year
Alternatives and similar repositories for ai-infra-landscape
Users that are interested in ai-infra-landscape are comparing it to the libraries listed below
Sorting:
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)β275Updated last year
- π An awesome & curated list of best LLMOps tools.β164Updated last week
- Finetune LLMs on K8s by using Runbooksβ170Updated last year
- A diverse, simple, and secure all-in-one LLMOps platformβ107Updated last year
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β90Updated 2 weeks ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.β130Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β70Updated 3 months ago
- This repository contains statistics about the AI Infrastructure products.β17Updated 7 months ago
- GPUd automates monitoring, diagnostics, and issue identification for GPUsβ440Updated this week
- Repository for open inference protocol specificationβ59Updated 5 months ago
- β258Updated this week
- Open Weight, tool-calling LLMsβ155Updated last year
- Holistic job manager on Kubernetesβ116Updated last year
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β260Updated last week
- JobSet: a k8s native API for distributed ML training and HPC workloadsβ268Updated last week
- Embed machine learning models in your Dockerfileβ95Updated last month
- A distributed engine for elastic workload, e.g. AI Agent, Robotβ31Updated 2 weeks ago
- High-scale LLM gateway, written in Rust. OpenTelemetry-based observability includedβ139Updated this week
- β37Updated this week
- Your AI Kubernetes Expertβ186Updated 2 years ago
- xet client tech, used in huggingface_hubβ302Updated this week
- Helm charts for the KubeRay projectβ51Updated 3 months ago
- A toolkit for discovering cluster network topology.β74Updated this week
- MaK(Mac+Kubernetes)llama - Running and orchestrating large language models (LLMs) on Kubernetes with macOS nodes.β42Updated last year
- Securely run AI-generated code in stateful sandboxes that run forever.β221Updated 6 months ago
- β174Updated last week
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.β204Updated 3 years ago
- π‘ Deploy AI models and apps to Kubernetes without developing a herniaβ33Updated last year
- Self-host LLMs with vLLM and BentoMLβ152Updated 2 weeks ago
- GenAI inference performance benchmarking toolβ106Updated last week