substratusai / runbooks
Finetune LLMs on K8s by using Runbooks
β170Updated 8 months ago
Alternatives and similar repositories for runbooks:
Users that are interested in runbooks are comparing it to the libraries listed below
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- πͺΆ Lightweight OpenAI drop-in replacement for Kubernetesβ144Updated last year
- Open Weight, tool-calling LLMsβ151Updated 6 months ago
- Constrain LLM outputβ110Updated 9 months ago
- A kubernetes operator you should never run under any circumstancesβ125Updated 2 years ago
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetryβ311Updated last week
- β108Updated last year
- Definition for Open Weights LIcensingβ135Updated 6 months ago
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-β¦β93Updated 2 months ago
- βΎοΈ Helix is a private GenAI stack for building AI applications with declarative pipelines, knowledge (RAG), API bindings, and first-classβ¦β491Updated this week
- Action library for AI Agentβ214Updated last month
- Foyle is a copilot to help developers deploy and operate their applications.β127Updated last month
- Fine-tuning and serving LLMs on any cloudβ89Updated last year
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-teβ¦β916Updated last week
- Chat strategies for LLMsβ94Updated 8 months ago
- β16Updated 11 months ago
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β68Updated this week
- Agent accuracy measurements for LLMsβ205Updated 10 months ago
- Tutorial for building LLM routerβ198Updated 9 months ago
- Helm charts to deploy Weaviate to k8sβ60Updated 2 weeks ago
- Complex LLM Workflows from Simple JSON.β297Updated last year
- β163Updated last year
- This is a landscape of the infrastructure that powers the generative AI ecosystemβ144Updated 6 months ago
- AI-to-AI Testing | Simulation framework for LLM-based applicationsβ137Updated last year
- π‘ Deploy AI models and apps to Kubernetes without developing a herniaβ32Updated 11 months ago
- Repository for open inference protocol specificationβ54Updated 9 months ago
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuoβ¦β175Updated last year
- Community-maintained Kubernetes config and Helm chart for Langfuseβ100Updated 2 weeks ago
- Implement recursion using English as the programming language and an LLM as the runtime.β232Updated 2 years ago
- Embed machine learning models in your Dockerfileβ88Updated last week