checkpoint-restore / criu-coordinatorLinks
A tool for coordinated checkpoint/restore of distributed applications with CRIU
☆30Updated 3 months ago
Alternatives and similar repositories for criu-coordinator
Users that are interested in criu-coordinator are comparing it to the libraries listed below
Sorting:
- Lightweight daemon for monitoring CUDA runtime API calls with eBPF uprobes☆143Updated 8 months ago
- An OS kernel module for fast **remote** fork using advanced datacenter networking (RDMA).☆69Updated 10 months ago
- The official implementation of OSDI'25 paper BlitzScale☆37Updated 3 months ago
- ☆37Updated 2 months ago
- Asynchronous Rust bindings for UCX☆78Updated 7 months ago
- Source code for the FAST '23 paper “MadFS: Per-File Virtualization for Userspace Persistent Memory Filesystems”☆45Updated 2 years ago
- A tool to detect infrastructure issues on cloud native AI systems☆52Updated 3 months ago
- ☆21Updated 5 months ago
- Repository linking to the software artifacts used for the MigrOS ATC 2021 paper☆18Updated 4 years ago
- ☆50Updated last year
- [NSDI '24] DINT: Fast In-Kernel Distributed Transactions with eBPF☆51Updated last year
- ☆27Updated 2 years ago
- ☆18Updated 5 years ago
- Enables building safer SPDK-based Rust applications☆81Updated last week
- CUDA checkpoint and restore utility☆397Updated 3 months ago
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆34Updated 2 years ago
- An efficient GPU resource sharing system with fine-grained control for Linux platforms.☆87Updated last year
- A storage plugin that provided CRI-O/Podman with the ability to lazy mount nydus images.☆40Updated 7 months ago
- Systematic and comprehensive benchmarks for LLM systems.☆44Updated last month
- A file system over RDMA☆28Updated 3 years ago
- Asynchronous Rust bindings for SPDK.☆17Updated 3 years ago
- rFaaS: a high-performance FaaS platform with RDMA acceleration for low-latency invocations.☆58Updated 5 months ago
- Resource Allocation for Dynamic Demands☆21Updated last year
- Hooked CUDA-related dynamic libraries by using automated code generation tools.☆172Updated 2 years ago
- ☆78Updated 2 years ago
- MCP Server for Linux Scheduler Management and Auto optimization☆78Updated 2 weeks ago
- Fast Container Provisioning on the Edge and over the WAN☆50Updated last month
- A distributed system for Agentic AI☆34Updated this week
- ☆53Updated last week
- An I/O benchmark for deep Learning applications☆95Updated last week