prem-research / prem-operatorLinks
📡 Deploy AI models and apps to Kubernetes without developing a hernia
☆33Updated last year
Alternatives and similar repositories for prem-operator
Users that are interested in prem-operator are comparing it to the libraries listed below
Sorting:
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.☆90Updated 2 weeks ago
- Open Weight, tool-calling LLMs☆155Updated last year
- Quickly and securely turn any Linux box into a build and deployment assistant☆24Updated last year
- Knowledge for GPTScript☆29Updated 11 months ago
- ☆18Updated last year
- ☆12Updated last year
- Self-host LLMs with vLLM and BentoML☆152Updated 2 weeks ago
- Foyle is a copilot to help developers deploy and operate their applications.☆133Updated 7 months ago
- The home of official Obot tools☆30Updated last week
- Finetune LLMs on K8s by using Runbooks☆170Updated last year
- Helm charts for llm-d☆50Updated 3 months ago
- ☆258Updated this week
- 🎉 An awesome & curated list of best LLMOps tools.☆164Updated last week
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…☆119Updated 8 months ago
- ☆37Updated this week
- Kuberentes LangChain Agent - Interact with Kubernetes Clusters using LLMs☆26Updated 2 years ago
- Routing on Random Forest (RoRF)☆214Updated last year
- Tutorial for building LLM router☆231Updated last year
- ☆138Updated 6 months ago
- Smart Kubernetes Scheduling☆81Updated this week
- This is a landscape of the infrastructure that powers the generative AI ecosystem☆149Updated last year
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆37Updated this week
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆40Updated this week
- ☆25Updated last year
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆130Updated this week
- Vanilla-Python ergonomics on top of DSPy☆37Updated 4 months ago
- 🪶 Lightweight OpenAI drop-in replacement for Kubernetes☆146Updated last year
- ☆14Updated last year
- Transformer GPU VRAM estimator☆67Updated last year
- A voice assistant application built with the LiveKit Agents framework, capable of using Model Context Protocol (MCP) tools to interact wi…☆58Updated last week