kubeagi / arcadia
A diverse, simple, and secure all-in-one LLMOps platform
β91Updated 3 months ago
Alternatives and similar repositories for arcadia:
Users that are interested in arcadia are comparing it to the libraries listed below
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β49Updated this week
- π An awesome & curated list of best LLMOps tools.β35Updated last month
- Chat to deploy and manage applications on any infrastructureβ126Updated last year
- Large language model fine-tuning capabilities based on cloud native and distributed computing.β91Updated 10 months ago
- Device-plugin for volcano vgpu which support hard resource isolationβ59Updated 2 weeks ago
- π« A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIβ¦β19Updated last month
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replicationβ168Updated this week
- Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! π«β123Updated this week
- A simple, High-Performance, Scalable ML/DL Models Repository based on OCI Artifactsβ33Updated last year
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.β60Updated 9 months ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β58Updated 2 weeks ago
- Using CRDs to manage GPU resources in Kubernetes.β193Updated 2 years ago
- β86Updated this week
- Knowledge for GPTScriptβ29Updated 2 months ago
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)β247Updated last year
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI seβ¦β89Updated this week
- π§― Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.β25Updated 3 weeks ago
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployβ¦β74Updated 8 months ago
- Helm charts from doubanβ48Updated 2 months ago
- Declarative Workflow of KubeVela which can run as standalone.β115Updated last month
- Device plugins for Volcano, e.g. GPUβ113Updated 4 months ago
- Open-source observability for your LLM application.β47Updated 2 weeks ago
- Kubeflow helm chartβ142Updated last year
- GPUd automates monitoring, diagnostics, and issue identification for GPUsβ259Updated this week
- a local implementation of OpenAI Assistants API: myla stands for MY Local Assistantβ50Updated 4 months ago
- An extendable scheduling and scaling tool built on Kubernetesβ58Updated last year
- A Cloud-Native Service Catalog and Full Lifecycle Management Platform accross Multi-cloud and Edgeβ33Updated last year
- Your AI Kubernetes Expertβ175Updated last year
- Helm charts for the KubeRay projectβ36Updated 3 months ago
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.β195Updated 2 years ago