prem-research / prem-operatorLinks
π‘ Deploy AI models and apps to Kubernetes without developing a hernia
β33Updated last year
Alternatives and similar repositories for prem-operator
Users that are interested in prem-operator are comparing it to the libraries listed below
Sorting:
- Quickly and securely turn any Linux box into a build and deployment assistantβ25Updated last year
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β93Updated 4 months ago
- Open Weight, tool-calling LLMsβ156Updated last year
- Knowledge for GPTScriptβ29Updated last year
- Agentkube - Run Kubernetes Like Never Beforeβ28Updated this week
- β18Updated last year
- β280Updated this week
- Helm charts for llm-dβ52Updated 6 months ago
- β25Updated last year
- Foyle is a copilot to help developers deploy and operate their applications.β133Updated 10 months ago
- Finetune LLMs on K8s by using Runbooksβ170Updated last year
- β12Updated last year
- Self-host LLMs with vLLM and BentoMLβ168Updated 2 weeks ago
- πͺΆ Lightweight OpenAI drop-in replacement for Kubernetesβ147Updated 2 years ago
- β100Updated 8 months ago
- Routing on Random Forest (RoRF)β239Updated last year
- Streamlit Web UI for AGiXTβ28Updated last month
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ63Updated 4 months ago
- The home of official Obot toolsβ33Updated this week
- InferX: Inference as a Service Platformβ156Updated this week
- β25Updated last year
- A voice assistant application built with the LiveKit Agents framework, capable of using Model Context Protocol (MCP) tools to interact wiβ¦β63Updated 3 months ago
- Vanilla-Python ergonomics on top of DSPyβ39Updated 8 months ago
- Route LLM requests to the best model for the task at hand.β177Updated 3 weeks ago
- Own your AI, search the web with itππβ94Updated last year
- β107Updated 3 months ago
- β68Updated last year
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-β¦β133Updated 11 months ago
- β93Updated last week
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated last year