flame-sh / flame
A distributed system for intelligent workload
☆19Updated 4 months ago
Related projects: ⓘ
- The API (CRD) of Volcano☆33Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆47Updated 3 weeks ago
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.☆16Updated 9 months ago
- Device plugins for Volcano, e.g. GPU☆98Updated this week
- ControllerMesh is a solution that helps developers manage their controllers/operators better with enhanced isolation.☆64Updated last year
- ☆105Updated last year
- A simulator of Kuberntes for batch and service workload.☆45Updated 3 years ago
- ☆22Updated last week
- Holistic job manager on Kubernetes☆107Updated 6 months ago
- Tools to use with the Kruise libraries☆43Updated 2 weeks ago
- A general-purpose GPU monitor, witch can monitor GPU cards and the usage of each pods or containers.☆19Updated 2 years ago
- Resource Topology exporter for Topology Aware Scheduler☆13Updated 3 months ago
- ☸️ Easy, advanced inference platform for large language models on Kubernetes☆15Updated this week
- Multi-cluster api gateway based on apiserver-aggregation.☆92Updated 3 weeks ago
- NVIDIA device plugin for Kubernetes☆15Updated 5 years ago
- ☆31Updated 3 years ago
- katalyst aims to provide a universal solution to help improve resource utilization and optimize the overall costs in the cloud. This repo…☆36Updated last week
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆104Updated last month
- ☆14Updated 2 years ago
- ☆24Updated this week
- ☆53Updated last week
- A collection of community maintained NRI plugins☆54Updated this week
- a sample to showcase how to create a k8s scheduler extender☆55Updated 4 years ago
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆31Updated last year
- elastic-gpu-agent is a Kubernetes device plugin for GPU resources allocation on node.☆54Updated 2 years ago
- a topology-scheduler and a descheduler extened from descheduler.☆15Updated 3 years ago
- RDMA CNI plugin for containerized workloads☆39Updated 2 weeks ago
- A Cloud-Native Service Catalog and Full Lifecycle Management Platform accross Multi-cloud and Edge☆33Updated 11 months ago
- Libraries for implementing aggregated apiservers☆84Updated last month
- API and go SDK for KubeBrain☆17Updated last month