Bruce-Lee-LY / cuda_hookLinks

Hooked CUDA-related dynamic libraries by using automated code generation tools.

☆167

Alternatives and similar repositories for cuda_hook

Users that are interested in cuda_hook are comparing it to the libraries listed below

Sorting:

NTHU-LSALAB / Gemini
An efficient GPU resource sharing system with fine-grained control for Linux platforms.
☆85Updated last year
RWTH-ACS / cricket
cricket is a virtualization solution for GPUs
☆215Updated last month
pkusys / TGS
Artifacts for our NSDI'23 paper TGS
☆89Updated last year
cjg / GVirtuS
This repository is an archive. Refer to https://github.com/gvirtus/GVirtuS
☆45Updated 3 years ago
alibaba / GPU-scheduler-for-deep-learning
GPU-scheduler-for-deep-learning
☆210Updated 4 years ago
SJTU-IPADS / PhoenixOS
Fast OS-level support for GPU checkpoint and restore
☆245Updated last month
Mellanox / gpu_direct_rdma_access
example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory
☆145Updated last year
Mellanox / nccl-rdma-sharp-plugins
RDMA and SHARP plugins for nccl library
☆210Updated last week
coldfunction / qCUDA
qCUDA: GPGPU Virtualization at a New API Remoting Method with Para-virtualization
☆129Updated 3 years ago
microsoft / NPKit
NCCL Profiling Kit
☆145Updated last year
coreweave / nccl-tests
NVIDIA NCCL Tests for Distributed Training
☆116Updated this week
eth-easl / orion
An interference-aware scheduler for fine-grained GPU sharing
☆150Updated 9 months ago
tkestack / vcuda-controller
☆537Updated last year
nchong / cudahook
Intercepting CUDA runtime calls with LD_PRELOAD
☆42Updated 11 years ago
Project-HAMi / HAMi-core
HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container
☆244Updated 2 weeks ago
ai-dynamo / nixl
NVIDIA Inference Xfer Library (NIXL)
☆688Updated this week
NVIDIA / nvbandwidth
A tool for bandwidth measurements on NVIDIA GPUs.
☆553Updated 6 months ago
Mellanox / nv_peer_memory
☆377Updated last year
eniac / paella
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
☆62Updated last year
antgroup / glake
GLake: optimizing GPU memory management and IO transmission.
☆483Updated 7 months ago
sakjain92 / Fractional-GPUs
Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions
☆160Updated 6 years ago
DMTCP-CRAC / CRAC-early-development
☆24Updated last year
pokerfaceSad / GPUMounter
A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod
☆128Updated 3 years ago
NVIDIA / cuda-checkpoint
CUDA checkpoint and restore utility
☆377Updated last month
casys-kaist / glet
☆53Updated 10 months ago
NVIDIA / DCGM
NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs
☆601Updated 2 weeks ago
zartbot / shallowsim
DeepSeek-V3/R1 inference performance simulator
☆170Updated 7 months ago
uccl-project / uccl
Ultra and Unified CCL
☆630Updated this week
bytedance / InfiniStore
KV cache store for distributed LLM inference
☆346Updated last month
AlibabaPAI / llumnix
Efficient and easy multi-instance LLM serving
☆502Updated last month