checkpoint-restore / criu-coordinatorLinks
A tool for coordinated checkpoint/restore of distributed applications with CRIU
☆30Updated 3 months ago
Alternatives and similar repositories for criu-coordinator
Users that are interested in criu-coordinator are comparing it to the libraries listed below
Sorting:
- Lightweight daemon for monitoring CUDA runtime API calls with eBPF uprobes☆143Updated 8 months ago
- ☆37Updated 2 months ago
- An OS kernel module for fast **remote** fork using advanced datacenter networking (RDMA).☆69Updated 10 months ago
- Asynchronous Rust bindings for UCX☆78Updated 7 months ago
- Asynchronous Rust bindings for SPDK.☆17Updated 3 years ago
- A tool to detect infrastructure issues on cloud native AI systems☆52Updated 3 months ago
- ☆21Updated 5 months ago
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆34Updated 2 years ago
- ☆27Updated 2 years ago
- A storage plugin that provided CRI-O/Podman with the ability to lazy mount nydus images.☆40Updated 7 months ago
- [NSDI '24] DINT: Fast In-Kernel Distributed Transactions with eBPF☆51Updated last year
- Repository linking to the software artifacts used for the MigrOS ATC 2021 paper☆18Updated 4 years ago
- ☆50Updated last year
- Source code for the FAST '23 paper “MadFS: Per-File Virtualization for Userspace Persistent Memory Filesystems”☆45Updated 2 years ago
- The official implementation of OSDI'25 paper BlitzScale☆37Updated 3 months ago
- An efficient GPU resource sharing system with fine-grained control for Linux platforms.☆87Updated last year
- Distributed KV cache coordinator☆92Updated this week
- Systematic and comprehensive benchmarks for LLM systems.☆44Updated 3 weeks ago
- Enables building safer SPDK-based Rust applications☆81Updated last week
- ☆21Updated 4 years ago
- A distributed system for Agentic AI☆34Updated this week
- A toolkit for discovering cluster network topology.☆86Updated last week
- ☆18Updated 2 years ago
- An I/O benchmark for deep Learning applications☆95Updated last week
- rFaaS: a high-performance FaaS platform with RDMA acceleration for low-latency invocations.☆58Updated 5 months ago
- InfiniStore: an elastic serverless cloud storage system (VLDB'23)☆24Updated 2 years ago
- A user level library for applications to transparently use Intel DSA.☆39Updated last month
- Lightning In-Memory Object Store☆47Updated 3 years ago
- CUDA checkpoint and restore utility☆397Updated 3 months ago
- NVIDIA GPUDirect Storage Driver☆310Updated this week