kaito-project / gritLinks
CRIU based GPU workload migration in Kubernetes
☆15Updated 4 months ago
Alternatives and similar repositories for grit
Users that are interested in grit are comparing it to the libraries listed below
Sorting:
- Example DRA driver that developers can fork and modify to get them started writing their own.☆88Updated 2 weeks ago
- Cloud Native Artifacial Intelligence Model Format Specification☆91Updated last week
- ☆25Updated last week
- Node Resource Interface☆328Updated last week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆257Updated this week
- A collection of community maintained NRI plugins☆88Updated last week
- ☆17Updated last week
- GenAI inference performance benchmarking tool☆93Updated this week
- ☆140Updated this week
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆176Updated 7 months ago
- Kubernetes Container Runtime Interface proxy service with hardware resource aware workload placement policies☆177Updated last month
- ☆258Updated last week
- A benchmarking tool to evaluate Knative performance☆38Updated last year
- All the things to make the scheduler extendable with wasm.☆129Updated 2 months ago
- A containerd snapshotter with data deduplication and lazy loading in P2P fashion☆201Updated this week
- The kernel module management operator builds, signs and loads kernel modules in Kubernetes clusters.☆107Updated last week
- NVIDIA Network Operator☆278Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆69Updated last month
- NVIDIA DRA Driver for GPUs☆434Updated this week
- Manage K8S like managing local files☆29Updated 2 years ago
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆32Updated last week
- Holistic job manager on Kubernetes☆116Updated last year
- [Moved to https://github.com/kubernetes-sigs/kwok] This is a fake kubelet. that can simulate any number of nodes and maintain pods on tho…☆64Updated 3 years ago
- Distributed KV cache coordinator☆66Updated this week
- Overlaybd: a block based remote image format. The storage backend of containerd/accelerated-container-image.☆305Updated 2 weeks ago
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆14Updated 2 months ago
- CNI DRA Driver☆29Updated 7 months ago
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆79Updated this week
- Inference scheduler for llm-d☆86Updated last week
- A toolkit for discovering cluster network topology.☆65Updated last week