chenhunghan / ialacol
πͺΆ Lightweight OpenAI drop-in replacement for Kubernetes
β144Updated last year
Alternatives and similar repositories for ialacol:
Users that are interested in ialacol are comparing it to the libraries listed below
- Open Weight, tool-calling LLMsβ151Updated 5 months ago
- Kuberentes LangChain Agent - Interact with Kubernetes Clusters using LLMsβ26Updated 2 years ago
- β‘Instant Stable Diffusion on k8s(Kubernetes) with Helmβ92Updated last year
- go-skynet helm chart repositoryβ63Updated 2 months ago
- Chart for deploying ChromaDB in Kubernetesβ46Updated this week
- An example of running local models with GGMLβ39Updated last year
- Finetune LLMs on K8s by using Runbooksβ170Updated 7 months ago
- π‘ Deploy AI models and apps to Kubernetes without developing a herniaβ32Updated 10 months ago
- β57Updated 3 weeks ago
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β67Updated this week
- starcoder server for huggingface-vscdoe custom endpointβ171Updated last year
- Helm chart for Ollama on Kubernetesβ416Updated last week
- π’ Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! π«β184Updated this week
- Helm charts to deploy Weaviate to k8sβ59Updated this week
- A high performance batching router optimises max throughput for text inference workloadβ16Updated last year
- β39Updated last year
- β18Updated 8 months ago
- β20Updated last year
- Self-hosted LLM chatbot arena, with yourself as the only judgeβ39Updated last year
- β161Updated this week
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hubβ160Updated last year
- OpenAI compatible API for serving LLAMA-2 modelβ218Updated last year
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.β91Updated last year
- Self-host LLMs with vLLM and BentoMLβ105Updated this week
- β145Updated last year
- An endpoint server for efficiently serving quantized open-source LLMs for code.β54Updated last year
- Local LLaMAs/Models in VSCodeβ53Updated last year
- TheBloke's Dockerfilesβ303Updated last year
- Knowledge for GPTScriptβ29Updated 5 months ago
- Open source stack for applying AI to workflows in secure environmentsβ164Updated 7 months ago