lfedgeai / SPEARLinks
Distributed Cloud-Edge Collaborative AI Agent Platform
β28Updated last week
Alternatives and similar repositories for SPEAR
Users that are interested in SPEAR are comparing it to the libraries listed below
Sorting:
- A light weight vLLM simulator, for mocking out replicas.β30Updated this week
- π§― Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.β30Updated last week
- OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)β174Updated this week
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, iβ¦β12Updated 2 years ago
- Inference scheduler for llm-dβ67Updated this week
- Distributed KV cache coordinatorβ43Updated last week
- GenAI inference performance benchmarking toolβ66Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β68Updated 2 months ago
- A toolkit for discovering cluster network topology.β56Updated last week
- Cloud Native Benchmarking of Foundation Modelsβ38Updated last month
- Go Bindings for CRIUβ202Updated last month
- A tool for coordinated checkpoint/restore of distributed applications with CRIUβ25Updated last month
- β37Updated last week
- π An awesome & curated list of best LLMOps tools.β135Updated this week
- FaaSNet: Scalable and Fast Provisioning of Custom Serverless Container Runtimes at Alibaba Cloud Function Compute (USENIX ATC'21)β55Updated 3 years ago
- Holistic job manager on Kubernetesβ117Updated last year
- A tool to detect infrastructure issues on cloud native AI systemsβ42Updated last month
- Serverless Paper Reading and Discussionβ37Updated 2 years ago
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policiesβ115Updated 2 weeks ago
- Example DRA driver that developers can fork and modify to get them started writing their own.β77Updated last week
- Kubernetes Container Runtime Interface proxy service with hardware resource aware workload placement policiesβ179Updated 3 months ago
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β224Updated last week
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Specβ30Updated this week
- DraNet is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding apβ¦β88Updated this week
- Push-Button End-to-End Testing of Kubernetes Operators and Controllersβ126Updated this week
- GPUd automates monitoring, diagnostics, and issue identification for GPUsβ387Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.β117Updated last week
- β23Updated 3 weeks ago
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.β17Updated last month
- Cloud Native Artifacial Intelligence Model Format Specificationβ76Updated last week