DMTCP-CRAC / CRAC-early-developmentLinks

☆24

Alternatives and similar repositories for CRAC-early-development

Users that are interested in CRAC-early-development are comparing it to the libraries listed below

Sorting:

Bruce-Lee-LY / cuda_hook
Hooked CUDA-related dynamic libraries by using automated code generation tools.
☆166Updated last year
NTHU-LSALAB / Gemini
An efficient GPU resource sharing system with fine-grained control for Linux platforms.
☆85Updated last year
cjg / GVirtuS
This repository is an archive. Refer to https://github.com/gvirtus/GVirtuS
☆45Updated 3 years ago
nchong / cudahook
Intercepting CUDA runtime calls with LD_PRELOAD
☆42Updated 11 years ago
pkusys / TGS
Artifacts for our NSDI'23 paper TGS
☆89Updated last year
Mellanox / nccl-rdma-sharp-plugins
RDMA and SHARP plugins for nccl library
☆209Updated last month
SymbioticLab / Salus
Fine-grained GPU sharing primitives
☆144Updated 2 months ago
stanford-futuredata / gavel
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆130Updated last year
msr-fiddle / philly-traces
☆195Updated 6 years ago
Mellanox / gpu_direct_rdma_access
example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory
☆145Updated last year
alibaba / GPU-scheduler-for-deep-learning
GPU-scheduler-for-deep-learning
☆210Updated 4 years ago
microsoft / NPKit
NCCL Profiling Kit
☆145Updated last year
casys-kaist / glet
☆52Updated 9 months ago
eth-easl / orion
An interference-aware scheduler for fine-grained GPU sharing
☆147Updated 8 months ago
RWTH-ACS / cricket
cricket is a virtualization solution for GPUs
☆214Updated last month
google / nccl-fastsocket
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
☆121Updated last year
Mellanox / nv_peer_memory
☆375Updated last year
SymbioticLab / Tiresias
Tiresias is a GPU cluster manager for distributed deep learning training.
☆162Updated 5 years ago
sakjain92 / Fractional-GPUs
Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions
☆159Updated 6 years ago
eniac / paella
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
☆62Updated last year
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆101Updated 2 years ago
stanford-mast / iBench
Suite of contentious microbenchmarks
☆55Updated 8 years ago
gpudirect / libgdsync
GPUDirect Async support for IB Verbs
☆131Updated 2 years ago
msr-fiddle / synergy
☆51Updated 2 years ago
S-Lab-System-Group / Lucid
Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs
☆55Updated 2 years ago
microsoft / msccl
Microsoft Collective Communication Library
☆363Updated 2 years ago
S-Lab-System-Group / HeliosData
Helios Traces from SenseTime
☆58Updated 3 years ago
JelixLi / Tetris
☆18Updated 2 years ago
netx-repo / PipeSwitch
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆126Updated 3 years ago
TankLabTJU / INFless
The source code of INFless，a native serverless platform for AI inference.
☆40Updated 3 years ago