premAI-io / prem-operator
π‘ Deploy AI models and apps to Kubernetes without developing a hernia
β32Updated 10 months ago
Alternatives and similar repositories for prem-operator:
Users that are interested in prem-operator are comparing it to the libraries listed below
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β63Updated this week
- Quickly and securely turn any Linux box into a build and deployment assistantβ24Updated 5 months ago
- Knowledge for GPTScriptβ29Updated 4 months ago
- Open Weight, tool-calling LLMsβ151Updated 5 months ago
- β12Updated 8 months ago
- A text-to-SQL prototype on the northwind sqlite datasetβ12Updated 6 months ago
- π An awesome & curated list of best LLMOps tools.β63Updated this week
- π€ MLOps with Hugging Face Spaces and Daggerβ46Updated last year
- Runner in charge of collecting metrics from LLM inference endpoints for the Unify Hubβ17Updated last year
- A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.β32Updated last month
- Document parser for RAGβ22Updated 4 months ago
- Kuberentes LangChain Agent - Interact with Kubernetes Clusters using LLMsβ26Updated last year
- Tools for formatting large language model prompts.β12Updated last year
- Foyle is a copilot to help developers deploy and operate their applications.β121Updated last week
- β18Updated 7 months ago
- AI aware proxyβ18Updated 6 months ago
- β173Updated last week
- CLI for spins up a K8s cluster locally in 10 seconds.β16Updated 9 months ago
- Official Vectorize MCP Serverβ16Updated last week
- Smart Kubernetes Schedulingβ76Updated this week
- Python module for running GPTScriptβ14Updated this week
- Simplifying Kubernetes cluster management with fully-managed Spacesβ61Updated last year
- β24Updated 6 months ago
- Structured outputs from DSPy and Jinja2β23Updated 3 months ago
- The home of official Obot toolsβ22Updated this week
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-β¦β89Updated last month
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Largβ¦β20Updated 2 weeks ago
- Embed machine learning models in your Dockerfileβ86Updated 2 weeks ago
- Embed anything.β29Updated 10 months ago
- k8sAI is a RAG-enabled GPT for working with k8sβ64Updated 10 months ago