tensorchord / ai-infra-landscapeLinks
This is a landscape of the infrastructure that powers the generative AI ecosystem
☆147Updated 8 months ago
Alternatives and similar repositories for ai-infra-landscape
Users that are interested in ai-infra-landscape are comparing it to the libraries listed below
Sorting:
- 🎉 An awesome & curated list of best LLMOps tools.☆122Updated last week
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆269Updated last year
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆129Updated last month
- Repository for open inference protocol specification☆56Updated last month
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆201Updated 3 years ago
- Finetune LLMs on K8s by using Runbooks☆170Updated 9 months ago
- 📡 Deploy AI models and apps to Kubernetes without developing a hernia☆32Updated last year
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆67Updated last year
- A diverse, simple, and secure all-in-one LLMOps platform☆105Updated 9 months ago
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.☆82Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆67Updated last month
- Helm charts for the KubeRay project☆44Updated this week
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆30Updated 6 months ago
- Holistic job manager on Kubernetes☆116Updated last year
- K8s device plugin for GPU sharing☆98Updated 2 years ago
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated last year
- A list of DevOps features enabled by awesome LLM software.☆54Updated 9 months ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆114Updated this week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆656Updated this week
- ☆221Updated this week
- Open Weight, tool-calling LLMs☆152Updated 8 months ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆236Updated last week
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…☆105Updated 4 months ago
- Runner in charge of collecting metrics from LLM inference endpoints for the Unify Hub☆17Updated last year
- Rust crates for XetHub☆43Updated 8 months ago
- Chart for deploying ChromaDB in Kubernetes☆49Updated last month
- Knowledge for GPTScript☆29Updated 7 months ago
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆141Updated 2 years ago
- GenAI inference performance benchmarking tool☆58Updated this week
- Helm charts to deploy Weaviate to k8s☆59Updated last month