tensorchord / ai-infra-landscape
This is a landscape of the infrastructure that powers the generative AI ecosystem
☆135Updated 3 months ago
Alternatives and similar repositories for ai-infra-landscape:
Users that are interested in ai-infra-landscape are comparing it to the libraries listed below
- Finetune LLMs on K8s by using Runbooks☆170Updated 5 months ago
- Repository for open inference protocol specification☆45Updated 6 months ago
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆248Updated last year
- Helm charts for the KubeRay project☆37Updated last week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆176Updated this week
- Open Weight, tool-calling LLMs☆151Updated 3 months ago
- 🎉 An awesome & curated list of best LLMOps tools.☆36Updated this week
- cluster/scheduler health monitoring for GPU jobs on k8s☆47Updated this week
- ChatData 🔍 📖 brings RAG to real applications with FREE✨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 milli…☆161Updated 2 months ago
- 📡 Deploy AI models and apps to Kubernetes without developing a hernia☆32Updated 8 months ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆79Updated this week
- K8s device plugin for GPU sharing☆99Updated last year
- A simple DAG for executing LLM calls and using tools.☆39Updated last year
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI se…☆100Updated this week
- Open source AI Agent Platform☆90Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆175Updated this week
- ☆106Updated 8 months ago
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆53Updated this week
- ☆153Updated last week
- Knowledge for GPTScript☆29Updated 3 months ago
- Embed machine learning models in your Dockerfile☆84Updated this week
- Community-maintained Kubernetes config and Helm chart for Langfuse☆68Updated last week
- Workflow orchestrator written in rust☆45Updated 3 weeks ago
- This repository contains statistics about the AI Infrastructure products.☆18Updated this week
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆287Updated this week
- OpenTelemetry Instrumentation for AI Observability☆259Updated this week
- Augment Swarm with durable execution to help you build reliable and scalable multi-agent systems.☆88Updated 2 months ago
- Holistic job manager on Kubernetes☆111Updated 11 months ago
- Action library for AI Agent☆206Updated this week
- Landscape2 is a tool that generates interactive landscapes websites☆160Updated this week