chenhunghan / ialacol
πͺΆ Lightweight OpenAI drop-in replacement for Kubernetes
β143Updated 9 months ago
Related projects β
Alternatives and complementary repositories for ialacol
- Open Weight, tool-calling LLMsβ149Updated last month
- go-skynet helm chart repositoryβ55Updated this week
- π‘ Deploy AI models and apps to Kubernetes without developing a herniaβ31Updated 6 months ago
- Helm charts to deploy Weaviate to k8sβ51Updated this week
- Kuberentes LangChain Agent - Interact with Kubernetes Clusters using LLMsβ23Updated last year
- β70Updated this week
- Helm chart for Ollama on Kubernetesβ262Updated this week
- β‘Instant Stable Diffusion on k8s(Kubernetes) with Helmβ90Updated last year
- AI Inference Operator for Kubernetesβ546Updated this week
- Finetune LLMs on K8s by using Runbooksβ169Updated 2 months ago
- Knowledge for GPTScriptβ29Updated 3 weeks ago
- β16Updated 3 months ago
- Chart for deploying ChromaDB in Kubernetesβ41Updated 2 months ago
- β47Updated this week
- β39Updated last year
- Community-maintained Kubernetes config and Helm chart for Langfuseβ55Updated this week
- A fast batching API to serve LLM modelsβ172Updated 6 months ago
- An OpenAI-like LLaMA inference APIβ111Updated last year
- β124Updated 9 months ago
- One Repo To Quickly Build One Docker File for HuggingChat Front and BackEndβ26Updated last year
- starcoder server for huggingface-vscdoe custom endpointβ167Updated last year
- β22Updated 2 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.β110Updated 6 months ago
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.β91Updated last year
- ποΈ Fine-tune, build, and deploy open-source LLMs easily!β394Updated last week
- Multi-node production GenAI stack. Run the best of open source AI easily on your own servers. Easily add knowledge from documents and scrβ¦β350Updated this week
- Repository hosting Langchain helm charts.β41Updated this week
- Run any Large Language Model behind a unified APIβ160Updated last year
- Constrain LLM outputβ106Updated 4 months ago
- An endpoint server for efficiently serving quantized open-source LLMs for code.β54Updated last year