sakjain92 / Fractional-GPUsLinks

Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions

☆159

Alternatives and similar repositories for Fractional-GPUs

Users that are interested in Fractional-GPUs are comparing it to the libraries listed below

Sorting:

SymbioticLab / Salus
Fine-grained GPU sharing primitives
☆143Updated this week
NTHU-LSALAB / Gemini
An efficient GPU resource sharing system with fine-grained control for Linux platforms.
☆84Updated last year
alibaba / GPU-scheduler-for-deep-learning
GPU-scheduler-for-deep-learning
☆210Updated 4 years ago
nchong / cudahook
Intercepting CUDA runtime calls with LD_PRELOAD
☆41Updated 11 years ago
google / nccl-fastsocket
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
☆118Updated last year
cjg / GVirtuS
This repository is an archive. Refer to https://github.com/gvirtus/GVirtuS
☆44Updated 3 years ago
Mellanox / nccl-rdma-sharp-plugins
RDMA and SHARP plugins for nccl library
☆199Updated last month
microsoft / hivedscheduler
Kubernetes Scheduler for Deep Learning
☆263Updated 3 years ago
stanford-futuredata / gavel
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆128Updated last year
Mellanox / nv_peer_memory
☆361Updated last year
netx-repo / PipeSwitch
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127Updated 3 years ago
kubedl-io / morphling
Automatic tuning for ML model deployment on Kubernetes
☆80Updated 9 months ago
SymbioticLab / Tiresias
Tiresias is a GPU cluster manager for distributed deep learning training.
☆155Updated 5 years ago
Bruce-Lee-LY / cuda_hook
Hooked CUDA-related dynamic libraries by using automated code generation tools.
☆160Updated last year
kzhang28 / Optimus
An Efficient Dynamic Resource Scheduler for Deep Learning Clusters
☆42Updated 7 years ago
geoffxy / habitat
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
☆63Updated 2 years ago
yalue / cuda_scheduling_examiner_mirror
A tool for examining GPU scheduling behavior.
☆86Updated 11 months ago
aws / aws-ofi-nccl
This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.
☆181Updated last week
coreweave / nccl-tests
NVIDIA NCCL Tests for Distributed Training
☆100Updated last week
stanford-mast / INFaaS
Model-less Inference Serving
☆90Updated last year
eniac / paella
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
☆60Updated last year
microsoft / superbenchmark
A validation and profiling tool for AI infrastructure
☆325Updated last week
uwsampl / nexus
☆82Updated last month
NVIDIA / MagnumIO
Magnum IO community repo
☆95Updated 2 months ago
facebookresearch / param
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…
☆147Updated last week
microsoft / NPKit
NCCL Profiling Kit
☆139Updated last year
NVIDIA / nvtx-plugins
Python bindings for NVTX
☆66Updated 2 years ago
awslabs / lorien
☆43Updated last year
king-jingxiang / pod-gpushare-metrics-exporter
Forked form
☆11Updated 4 years ago
petuum / adaptdl
Resource-adaptive cluster scheduler for deep learning training.
☆447Updated 2 years ago