kubeagi / arcadia
A diverse, simple, and secure all-in-one LLMOps platform
β101Updated 6 months ago
Alternatives and similar repositories for arcadia:
Users that are interested in arcadia are comparing it to the libraries listed below
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β123Updated this week
- β¨Kubewizard is An AI-Agent for automated Kubernetes troubleshooting, and management, based on LangChain and k8s related tools.β18Updated 3 months ago
- π An awesome & curated list of best LLMOps tools.β83Updated last week
- A simple, High-Performance, Scalable ML/DL Models Repository based on OCI Artifactsβ33Updated last year
- Large language model fine-tuning capabilities based on cloud native and distributed computing.β92Updated last year
- Chat to deploy and manage applications on any infrastructureβ126Updated last year
- A distributed engine for intelligent workloadβ27Updated 2 months ago
- π« A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIβ¦β20Updated 4 months ago
- β36Updated 2 weeks ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.β65Updated last year
- β66Updated this week
- This is a landscape of the infrastructure that powers the generative AI ecosystemβ141Updated 6 months ago
- a local implementation of OpenAI Assistants API: myla stands for MY Local Assistantβ53Updated 7 months ago
- AIEvo is a multi-agent framework open sourced by Ant Group. Through this framework, you can create a multi-agent application.β71Updated 3 weeks ago
- Knowledge for GPTScriptβ29Updated 5 months ago
- Using CRDs to manage GPU resources in Kubernetes.β197Updated 2 years ago
- An extendable scheduling and scaling tool built on Kubernetesβ58Updated last year
- Device-plugin for volcano vgpu which support hard resource isolationβ71Updated last month
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprisesβ16Updated 4 months ago
- A Cloud-Native Service Catalog and Full Lifecycle Management Platform accross Multi-cloud and Edgeβ33Updated last year
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployβ¦β80Updated 11 months ago
- Core API package for KubeVela https://github.com/kubevela/kubevelaβ24Updated last month
- Portal to let user explore, create, manage AI Agents/GPTsβ9Updated last year
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β64Updated 3 weeks ago
- π’ Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! π«β184Updated this week
- Declarative Workflow of KubeVela which can run as standalone.β120Updated last month
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replicationβ397Updated this week
- Open-source observability for your LLM application.β51Updated 3 months ago
- A stress testing tool for the scheduler in a large-scale scenario.β16Updated 11 months ago
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)β263Updated last year