substratusai / runbooks
Finetune LLMs on K8s by using Runbooks
☆170Updated 7 months ago
Alternatives and similar repositories for runbooks:
Users that are interested in runbooks are comparing it to the libraries listed below
- Fine-tuning and serving LLMs on any cloud☆89Updated last year
- This is a landscape of the infrastructure that powers the generative AI ecosystem☆141Updated 6 months ago
- A kubernetes operator you should never run under any circumstances☆125Updated last year
- Cedana: Access and run on compute anywhere in the world, on any provider. Migrate seamlessly between providers, arbitraging price/perform…☆58Updated last week
- Open Weight, tool-calling LLMs☆151Updated 5 months ago
- 🪶 Lightweight OpenAI drop-in replacement for Kubernetes☆144Updated last year
- Foyle is a copilot to help developers deploy and operate their applications.☆125Updated 3 weeks ago
- Helm charts to deploy Weaviate to k8s☆59Updated this week
- Constrain LLM output☆110Updated 8 months ago
- Agent accuracy measurements for LLMs☆205Updated 10 months ago
- ☆108Updated 11 months ago
- Self-Host Cloud-Native Apps with the Ease of PaaS☆188Updated this week
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…☆92Updated 2 months ago
- ☆16Updated 10 months ago
- Action library for AI Agent☆212Updated 2 weeks ago
- Chart for deploying ChromaDB in Kubernetes☆46Updated last month
- Definition for Open Weights LIcensing☆135Updated 5 months ago
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆307Updated 3 weeks ago
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆174Updated last year
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.☆65Updated this week
- 🐦 A open blazing-fast simple model gateway for rapid development of production GenAI apps☆144Updated 8 months ago
- 📡 Deploy AI models and apps to Kubernetes without developing a hernia☆32Updated 10 months ago
- Repository for open inference protocol specification☆53Updated 8 months ago
- Structured LLM APIs☆156Updated last year
- A simple DAG for executing LLM calls and using tools.☆41Updated last year
- Run GGML models with Kubernetes.☆174Updated last year
- [deprecated] AI Gateway - core infrastructure stack for building production-ready AI Applications☆158Updated last year
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆887Updated this week
- LLM fine-tuning and eval☆346Updated last year
- Kuberentes LangChain Agent - Interact with Kubernetes Clusters using LLMs☆26Updated 2 years ago